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Disclosed herein are constitutively activated, non-endogenous versions of endogenous human G protein-coupled receptors comprising 
(a) the following amino acid sequence region (C-tcrminus to N-tcrminus orientation) and/or (b) the following nucleic acid sequence 
region (3 to 5 orientetion) transvcrsing the transmembrane-6 (TM6) and intracellular loop-3 aC3) regions of the GPCR: (a) P' AAis 
X and/or (b) P^" (AA-codon),5 Xcodan, respectively. In a most preferred embodiment. P» and P«>don are endogenous proline and an 
endogenous nucleic acid encoding region encoding proline, respectively, located within TM6 of the non^ndogenous GPCR- AAk and 
(AA-codon),5 are 15 endogenous ammo acid residues and 15 codons encoding endogenous amino acid residues, respectively and X and 
Xcodon are non-endogenous lysine and a non^ndogenous nucleic acid encoding region encoding lysine, respectively, located' within IC3 
of the non-endogenous GPCR. Because it is most preferred that the non-endogenous human GPCRs which incorporate these mutations 
arc incorporated mto mammalian cells and utilized for the screening of the candidate compounds, the non-endoVcnous human GPCR 
mcorporating the mutation need not be purified and isolated per se (i.e.. these are incorporated within the cellular membrane of a mammalian 
cell), although such purified and isolated non^ndogenous human GPCRs are well within the purview of this disclosure 
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NON-ENDOGENOUS, CONSTITUTIVELY ACTIVATED 
HUMAN G PROTEIN-COUPLED RECEPTORS 

The benefits of commonly owned U.S. Serial Number 09/170,496, filed 
October 13, 1998, U.S. Serial Number 08/839, 449 filed April 14, 1997 (now abandoned), 
5 U.S. Serial Number 09/060,188, filed April 14, 1998; U.S. Provisional Number 60/090,783, 
filed June 26, 1998; and U.S. Provisional Number 60/095,677, filed on August 7, 1998, are 
hereby claimed. Each of the foregoing applications are incorporated by reference herein in 
their entirety. 

FIELD OF THE INVENTION 

5 The invention disclosed in this patent document relates to transmembrane 

receptors, and more particularly to human G protein-coupled receptors (GPCRs) which have 
been altered such that altered GPCRs are constitutively activated. Most preferably, the altered 
human GPCRs are used for the screening of therapeutic compounds. 
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BACKGROUND OF THE INVENTION 

Although a number of receptor classes exist in humans, by far the most abundant and 
therapeuticaUy relevant is represented by the Gprotein-coupled receptor (GPCR or GPCRs) class. 
It is estimated that there are some 100,000 genes within the human genome, and of these, 
5 approximately 2% or 2,000 genes, are estimated to code for GPCRs. Of these, there are 
approximately 100 GPCRs for which the endogenous Ugand fliat binds to the GPCR has been 
identified. Because of the significant time-lag that exists between the discovery of an endogenous 
GPCR and its endogenous ligand, it can be presumed that the remaining 1,900 GPCRs will be 
identified and characterized long before the endogenous Ugands for these receptors are identified. 
10 Indeed, therapiditybywhichtheHumanGenomeProjectis sequencingthe 100.000 human genes 
indicates that the remaining human GPCRs will be fiilly sequenced within the next few years. 
Nevertheless, and despite the efforts to sequence the human genome, it is still very unclear as to 
how scientists will be able to rapidly, effectively and efficiently exploit this infomiation to 
improve and enhance the human conditioa The present invention is geared towaixls this 
1 5 important objective. 

Receptors, including GPCRs, for which the endogenous ligand has been identified are 
refeired to as "known" receptors, while receptors for which the endogenous ligand has not been 
identified are refeired to as "orphan" receptors. Hiis distinction is not merely semantic, 
particularly in the case of GPCRs. GPCRs represent an important area for the development of 
!0 phamiaceutical products: fiom approximately 20 of the 100 known GPCRs, 60% of all 
prescription phamiaceuticals have been developed. Thus, the orphan GPCRs are to the 
pharmaceutical industry what gold was to California in the late 19^ century - an opportunity to 
drive growth, expansion, enhancement and development. A serious drawback exists, however. 
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with orphan receptors relative to the discovery of novel therapeutics. This is because the 
traditional approach to the discovery and development of phannaceuticals has required access to 
both the receptor and its endogenous ligand. Thus, heretofore, orphan GPCRs have presented the 
art with a tantalizing and undeveloped resource for the discovery of pharmaceuticals. 
5 Under the traditional ^roach to the discovery of potential therapeutics, it is generally the 

case that the receptor is first identified. Before dmg discovery efforts can be initiated, elaborate, 
time consuming and expensive procedures are typically put into place in order to identify, isolate 
and generate the receptor^s endogenous ligand - this process can require fi^m between 3 and ten 
years per receptor, at a cost of about SSmillion (U.S.) per receptor. These time and financial 
10 resources must be expended before the traditional approach to drug discovery can commence. 
This is because traditional drug discovery techniques rely upon so-called "competitive binding 
assays" whereby putative therapeutic agents are "screened" against the receptor in an effort to 
discover compounds that either block the endogenous ligand fi^m binding to the receptor 
("antagonists"), or enhance or mimic the effects of the ligand binding to the receptor ("agonists"). 
15 The overall objective is to identify compounds that prevent cellular activation when the ligand 
binds to the receptor (the antagonists), or that enhance or increase cellular activity that would 
otherwise occur if the ligand was properly binding with the receptor (the agonists). Because the 
endogenous ligands for orphan GPCRs are by definition not identified, the ability to discover novel 
and unique therapeutics to these receptors using traditional dmg discovery techniques is not 
20 possible. The present invention, as will be set forth in greater detail below, overcomes these and 
other severe limitations created by such traditional drug discovery techniques. 

GPCRs share a common structural motif. All these receptors have seven sequences of 
between 22 to 24 hydrophobic amino acids that form seven alpha helices, each of which spans the 
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membrane (each span is identified by number, Le., transmembrane- 1 (TM-1), transmebrane-2 
(TM-2), etc.). The transmembrane heUces are joined by strands of amino acids between 
transmembrane-2 and transmembrane-3, transmembrane-4 and transmembrane-5, and 
transmembrane-6 and transmembrane-? on the exterior, or "extracellular" side, of the cell 
5 membrane (these are referred to as "extracellular** regions 1, 2 and 3 (EC-1, EC-2 and EC-3), 
respectively). The transmembrane helices are also joined by strands of amino acids between 
transmembrane- 1 and transmembrane-2, transmembrane-3 and transmembrane-4, and 
transmembrane-5 and transmembrane-6 on the interior, or "intracellular" side, of the cell 
membrane (these are referred to as "intracellular" regions 1, 2 and 3 (IC-1, IC-2 and IC-3), 
1 0 respectively). The "carboxy" ("C") terminus of the receptor lies in the intracellular space within 
the cell, and the "amino" ("N") temiinus of the receptor lies in the extracellular space outside of 
the cell. The general structure of G protein-coupled receptors is depicted in Figure 1. 

Generally, when an endogenous ligand binds with the receptor (often referred to as 
"activation" of the receptor), there is a change in the conformation of the intracellular region that 
1 5 allows for coupling between the intracellular region and an intracellular "G-protein." Although 
other G proteins exist, currently, Gq. Gs, Gi, and Go are G proteins that have been identified. 
Endogenous ligand-activated GPOl coupling with the G-protein begins a signaling cascade 
process (referred to as "signal transduction"). Under normal conditions, signal transduction 
ultimately results in cellular activation or cellular inhibition. It is thought that the IC-3 loop as 
20 well as the carboxy terminus of the receptor interact with the G protein. A principal focus of this 
invention is directed to the transmcmbrane-6 (TM6) region and the intracellular-3 (IC3) region of 
theGPCR. 

Under physiological conditions, GPCRs exist in die cell membrane in equilibrium between 
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two different confonnations: an "inactive" state and an "active" state. As shown schematically in 
Figure 2, a receptor in an inactive state is unable to link to the intracellular signaling transduction 
pathway to produce a biological response. Changing the receptor conformation to the active state 
allows linkage to the transduction pathway (via the G-protein) and produces a biological response. 

A receptor may be stabilized in an active state by an endogenous iigand or a compound 
such as a drug. Recent discoveries, including but not exclusively limited to modifications to the 
amino acid sequence of the recq)tor, provide means other than endogenous ligands or drugs to 
promote and stabilize the receptor in the active state conforaiation. These means effectively 
stabilize die receptor in an active state by simulating the effect of an endogenous ligand binding 
to the receptor. Stabilization by such ligand-independent means is temied "constitutive receptor 
activation." 

As noted above, the use of an orphan receptor for screening purposes has not been 

possible. This is because the traditionar'dogma"regaidingscreeningofcompounds mandates that 
the ligand for the receptor be known. By definition, then, this approach has no applicability with 
respect to orphan receptors. Thus, by adhering to this dogmatic approach to die discovery of 
therapeutics, die art, in essence, has taught and has been taught to forsake the use of orphan 
receptors unless and until the endogenous ligand for the receptor is discovered. Given that there 
are an estimated 2,000 G protein coupled receptors, the majority of which are orphan receptors, 
such dogma castigates a creative, unique and distinct approach to the discovery of therapeutics. 

Information regarding die nucleic acid and/or amino acid sequences of a variety of GPCRs 
is summarized below in Table A. Because an important focus of the invention disclosed herein 
is directed towards orphan GPCRs, many of the below-cited references arc related to orphan 
GPCRs. However, this list is not intended to imply, nor is this list to be construed, legally or 
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Otherwise, that the invention disclosed herein is only ^plicable to orphan GPCRs or the specific 
GPCRs listed below. Additionally, certain receptors that have been isolated are not the subject of 
publications per se; for example, reference is made to a G Protein-Coupled Receptor database on 
the "world-wide web" (neither the named inventors nor the assignee have any aflBliation with this 
site) that Hsts GPCRs. Other GPCRs are the subject of patent applications owned by the present 
assignee and these are not listed below (including GPR3, GPR6 and GPRl 2; see U.S. Provisional 
Number 60/094879): 



Table A 



neccpior iName 


Publication Reference 


GPRl 


23 Genomics 609(1994) 


GPR4 


14 DNA and Cell Biology 25 (1995) 


GPR5 


14 DNA and Cell Biology 25 (1995) 


GPR7 


28 Genomics 84(1995). 


GPR8 


28 Genomics 84(1995) 


GPR9 


184 J. Exp. Med. 963(1996) 


GPRIO 


29 Genomics 335 (1995) 


GPR15 


32 Genomics 462 (1996) 


GPR17 


70 J Neurochem. 1357 (1998) 


GPRl 8 


42 Genomics 462(1997) 


GPR20 


187 Gene 75(1997) 


GPR21 


187 Gene 75(1997) 


GPR22 


187 Gene 75(1997) 


GPR24 


398 FEBS Lett. 253 (1996) 


GPR30 


45 Genomics 607(1997) 


GPR31 


42 Genomics 519 (1997) 


GPR32 


50 Genomics 281(1997) 


GPR40 


239 Biochem. Bipphys. 




Res. Commun. 543 (1997) 


GPR41 


239 Biochem. Biophys. 




Res. Commim. 543 (1997) 


GPR43 


239 Biochem. Biophys. 




Res. Commun. 543(1997) 


APJ 


136 Gene 355 (1993) 


BLRl 


22 Eur. J. Immunol. 2759 (1992) 


CEPR 


231 Biochem. Biophys. 




Res. Commun. 651 (1997) 


EBIl 


23 Genomics 643(1994) 


EBI2 


67 J. Virol. 2209(1993) 


ETBR-LP2 


424 FEBS Lett 193(1998) 


GPCR-CNS 


54 Brain Res, Mol. Brain Res. 152 (1998); 




45 Genomics 68(1997) 


GPR-NGA 


394 FEBS Lett 325 (1996) 


H9 


386 FEBS Lett 219 (1996) 
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HBA954 


1261 Biochim. Biophys. Acta 121 (1995) 




HG38 


247 Biochem. Biophys. 




Res. Commun. 266 (1998) 


HM74 


5 Int. Immunol 1239(1993) 


OGRl 


35 Genomics 397(1996) 


V28 


163 Gene 295(1995) 



As will be set forth and disclosed in greater detail below, utilization of a mutational cassette to 
modify the endogenous sequence of a human GPCR leads to a constituti vely activated version of 
the human GPCR. These non-endogenous, constitutively activated versions ofhuman GPCRs can 
be utilized, inter alia, for the screening of candidate compounds to directly identify compounds 
of, e.g., therapeutic relevance. 

SUMMARY OF THE INVENTION 

Disclosed herein is anon-endogenous, human G protein-coupled receptor comprising 
(a) as a most preferred amino acid sequence region (C-terminus to N-terminus orientation) 
and/or (b) as a most preferred nucleic acid sequence region (3' to 5' orientation) transversing 
the transmembrane-6 (TM6) and intracellular loop-3 (IC3) regions of the GPCR: 
(a) P» AAjsX 

wherein: 

(1) P' is an amino acid residue located within the TM6 region of 
the GPCR, where P^ is selected from the group consisting of (i) 
the endogenous GPCR's proline residue, and (ii) a non- 
endogenous amino acid residue other than proline; 

(2) AA,5 are 1 5 amino acids selected from the group consisting of 
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(a) the endogenous GPCR*s amino acids (b) non-endogenous 
amino acid residues, and (c) a combination of the endogenous 
GPCR's amino acids and non-endogenous amino acids, 
excepting that none of the 1 5 endogenous amino acid residues 
that are positioned within the TM6 region of the GPCR is 
proline; and 

X is a non-endogenous amino acid residue located within the 
IC3 region of said GPCR, preferably selected from the group 
consisting of lysine, hisitidine and arginine, and most 
preferably lysine, excepting that when the endogenous amino 
acid at position X is lysine, then X is an amino acid other than 
lysine, preferably alanine; 
and/or 

(b) P"^°"(AA-codon),5X^„ 

15 wherein: 

(1) P«^ is a nucleic acid sequence within the TM6 region of the 
GPOl, where P«*^ encodes an amino acid selected from the 
group consisting of (i) the endogenous GPCR's proline residue, 
and (ii) a non-endogenous amino acid residue other than proline; 

(2) (AA-codon),3 are 15 codons encoding 15 amino acids selected 
from the group consisting of (a) the endogenous GPCR's amino 
acids (b) non-endogenous amino acid residues and (c) a 
combination of the endogenous GPCR*s amino acids and non- 



(3) 



10 
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endogenous amino acids, excepting that none of the 15 



endogenous codons within the TM6 region of the GPCR encodes 



a proline amino acid residue; and 



(3) 



X^j^ is a nucleic acid encoding region residue located within the 



5 



IC3 region of said GPCR, where X^^dm encodes a non-endogenous 



amino acid, preferably selected from the group consisting of 



lysine, hisitidine and arginine, and most preferably lysine. 



excepting that when the endogenous encoding region at position 



Xcodon encodes the amino acid lysine, then Xcodon encodes an amino 



10 



acid other than lysine, preferably alanine. 



The terms endogenous and non-endogenous in reference to these sequence cassettes are relative 
to the endogenous GPCR. For example, once the endogenous proline residue is located within the 
TM6 region of a particular GPCR, and the 1 6* amino acid therefrom is identified for mutation to 
constitutively activate the receptor, it is also possible to mutate the endogenous proline residue 
15 (Le., once the marker is located and the 16* amino acid to be mutated is identified, one may mutate 
the marker itself), although it is most preferred that the proline residue not be mutated. Similarly, 
and while it is most preferred that AA15 be maintained in their endogenous forms, these amino 
acids may also be mutated. The only amino acid that must be mutated in the non-endogenous 
version of the human GPCR is X Le., the endogenous amino acid that is 16 residues from P' 
20 cannot be maintained in its endogenous fomi and must be mutated, as fiirther disclosed herein. 
Stated again, while it is preferred that in the non-endogenous version of the human GPCR, P^ and 
AA|5 remain in their endogenous fomis (Le,, identical to their wild-type foims), once X is 
identified and mutated, any and/or all of P' and AA,5 can be mutated. This ^phes to the nucleic 
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acid sequences as well. In those cases where the endogenous amino acid at position X is lysine, 
then in the non-endogenous version of such GPCR, X is an amino acid other than lysine, 
preferably alanine. 

Accordingly, and as a hypothetical example, if the endogenous GPCR has the following 
5 endogenous amino acid sequence at the above-noted positions: 

P- AACCTTGGRRRDDDE -Q 
then any of the following exemplary and hypothetical cassettes would fall within the scope of 
the disclosure (non-endogenous amino acids are set forth in bold): 

P-AACCTTGGRRRDDDE -K 
10 P-AACCTTHIGRRDDDE -K 

P-ADEETTGGRRRDDDE -A 
P-LLKFMSTWZLVAAPQ -K 
A-LLKFMSTWZLVAAPQ -K 
It is also possible to add amino acid residues within AA15, but such an approach is not particularly 
1 5 advanced. Indeed, in the most prefen:ed embodiments, the only amino acid that differs in the non- 
endogenous version of the human GPCR as compared with the endogenous version of that GPCR 
is the amino acid in position X; mutation of this amino acid itself leads to constitutive activation 
of the receptor. 

Thus, in particularly preferred embodiments, P^ and P**^ are endogenous proline and an 
20 endogenous nucleic acid encoding region encoding proline, respectively; and X and X^odon are non- 
endogenous lysine or alanine and a non-endogenous nucleic acid encoding region encoding lysine 
or alanine, respectively, with lysine being most preferred. Because it is most preferred that the 
non-endogenous versions of the human GPCRs which incoiporate these mutations are 
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incoiporated into mammalian cells and utilized for the screening of candidate compounds, the non- 
endogenous human GPCR incorporating the mutation need not be purified and isolated perse (/. e., 
these are incoiporated within the cellular membrane of a mammalian cell), although such purified 
and isolated non-endogenous human GPCRs are well within the purview of this disclosure. Gene- 
5 targeted and transgenic non-human mammals (preferably rats and mice) incorporating the non- 
endogenous human GPCRs are also within the pundew of this invention; in particular, gene- 
targeted mammals are most preferred in that these animals will incorporate the non-endogenous 
versions of the human GPCRs in place of the non-human mammal's endogenous GPCR-encoding 
region (techniques for generating such non-human mammals to replace the non-human mammal's 
10 protein encoding region with a human encoding region are well known; see, for example, U.S. 
Patent No. 5,777,194.) 

It has been discovered that these changes to an endogenous human GPCR render the 
GPCR constitutively active such that, as will be fiirther disclosed herein, the non-endogenous, 
constitutively activated version of the human GPCR can be utilized for, inter alia, the direct 
1 5 screening of candidate compounds without the need for the endogenous hgand. Thus, methods 
for using these materials, and products identified by these methods are also within the purview of 
the following disclosure. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows a generalized structure of a G protein-coupled receptor with the numbers 
20 assigned to the transmembrane helixes, the intracellular loops, and the extracellular loops. 

Figure 2 schematically shows the two states, active and inactive, for a typical G 
protein coupled receptor and the linkage of the active state to the second messenger 
transduction pathway. 
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Figure 3 is a sequence diagram of the preferred vector pCMV, including restriction 
enzymen site locations. 

Figure4 is a diagrammatic representation of the signal measured comparingpCMV, non- 
endogenous, constitutively active GPR30 inhibition of GPR6-mediated activation of CRE-Luc 
5 reporter with endogenous GPR30 inhibition of GPR6-mediated activation of CRE-Luc 
reporter. 

Figure 5 is a diagrammatic representation of the signal measured comparing pCMV, non- 
endogenous, constitutively activated GPR17 inhibition of GPR3-mediated activation of CRE- 
Luc reporter viith endogenous GPR17 inhibition of GPR3-mediated activation of CRE-Luc 
10 reporter. 

Figure 6 provides diagrammatic results of the signal measured comparing control 
pCMV, endogenous APJ and non-endogenous APJ. 

Figure 7 provides an illustration of IP3 production from non-endogenous human 5- 
HTjA receptor as compared to the endogenous version of this receptor. 
15 Figure 8 are dot-blot forniat results for GPRl (8A), GPR30 (8B) and APJ (8C). 

DETAILED DESCRIPTION 

The scientific literature that has evolved around receptors has adopted a number of terms 
to refer to ligands having various effects on receptors. For clarity and consistency, the following 
definitions will be used throughout this patent document. To the extent that these definitions 
20 conflict with other definitions for these terms, the following definitions shall control: 

AGONISTS shall mean compounds that activate the intracellular response when they bind 
to the receptor, or enhance GTP binding to membranes. 



wo 00/22129 PCT/US99/23938 



-13- 

AMINO ACID ABBREVUTIONS used herein are set below. 



ALANINE 


ALA 


A 


ARGININE 


ARG 


R 


ASPARAGINE 


ASN 


N 


ASPARTTCACID 


ASP 


D 


CYSTEINE 


CYS 


C 


GLUTAMIC AOD 


GLU 


E 


GLUTAMINE 


GLN 


Q 


GLYCINE 


GLY 


G 


HISnDINE 


HIS 


H 


ISOLEUONE 


ILE 


I 


LEUCINE 


LEU 


L 


LYSINE 


LYS 


K 


METHIONINE 


MET 


M 


PHENYLALANINE 


PHE 


F 


PROLINE 


PRO 


P 


SERINE 


SER 


s • 


THREONINE 


THR 


T 


TOYPTOPHAN 


TRP 


W 


TYROSINE 


TYR 


Y 


VALINE 


VAL 


V 



PARTIAL AGONISTS shall mean cxjmpounds which activate the intracellular response 
when they bind to the receptor to a lesser degree/extent than do agonists, or enhance GTP binding 
to membranes to a lesser degree/extent than do agonists 

25 ANTAGONIST shall mean compounds that competitively bind to the receptor at the 

same site as the agonists but which do not activate the intracellular response initiated by the active 
form of the receptor, and can thereby inhibit the intracellular responses by agonists or partial 
agonists. ANTAGONISTS do not diminish the baseline intracellular response in the absence of 
an agonist or partial agonist. 

30 CANDIDATE COMPOUND shall mean a molecule (for example, and not limitation, 

a chemical compound) which is amenable to a screening technique. Preferably, the phrase 
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"candidate compound" does not include compounds which were publicly known to be compounds 
selected from the group consisting of inverse agonist, agonist or antagonist to a receptor, as 
previously detemiined by an indirect identification process ("indirectly identified compound"); 
more preferably, not including an indirectly identified compound which has previously been 
5 detemiined to have therapeutic eflScacy in at least one mammal; and, most preferably, not 
including an indirectly identified compound which has previously been detemiined to have 
therapeutic utility in humans. 

CODON shall mean a grouping of three nucleotides (or equivalents to nucleotides) which 
generally comprise a nucleoside (adenosine (A), guanosine (G), cytidine (C), uridine (U) and 
1 0 thymidine (T)) coupled to a phosphate group and which, when translated, encodes an amino acid. 

COMPOUND EFFICACY shall mean a measurement of the ability of a compound to 
inhibit or stimulate receptor fimctionality, as opposed to receptor binding affinity. A preferred 
means of detecting compound efficacy is via measurement of, eg., p^S]GTPyS binding, as further 
disclosed in the Example section of this patent document 
15 CONSTTTUnVELY ACTIVATED RECEPTOR shall mean a receptor subject to 

constitutive receptor activation. In accordance with the invention disclosed herein, a non- 
endogenous, human constitutively activated G protein-coupled receptor is one that has been 
mutated to include the amino acid cassette P'AAijX, as set forth in greater detail below. 

CONSTITUTIVE RECEPTOR ACTIVATION shall mean stabilization of a receptor 
20 in the active state by means other than binding of the receptor with its endogenous ligand or a 
chemical equivalent thereof Preferably, a G protein-coupled receptor subjected to constitutive 
receptor activation in accordance with the invention disclosed herein evidences at least a 10% 
dijBFerence in response (increase or decrease, as the case may be) to the signal measured for 
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constitutive activation as compared with the endogenous form of that GPCR, more preferably, 
about a 25% difference in such comparative response, and most preferably about a 50% difference 
in such comparative response. When used for the puiposes of directly identifying candidate 
compounds, it is most preferred that the signal difference be at least about 50% such that there is 
5 a sufficient difference between the endogenous signal and the non-endogenous signal to 
differentiate between selected candidate compounds. In most instances, the "difference" will be 
an increase in signal; however, with respect to Gs-coupled GPCRS. the "difference" measured is 
preferably a decrease, as will be set forth in greater detail below. 

CONTACT or CONTACTING shall mean bringing at least two moieties together, 
1 0 whether in an in vitro system or an in vivo system. 

DIRECTLY IDENTIFYING or DIRECTLY IDENTIFIED, in relationship to the 
phrase "candidate compound", shall mean the screening of a candidate compound against a 
constitutively activated G protein-coupled receptor, and assessing the compound efScacy of such 
compound. This phrase is, under no circumstances, to be interpreted or understood to be 
1 5 encompassed by or to encompass the phrase "indirectly identifying" or "indirectly identified." 

ENDOGENOUS shall mean a material that is naturally produced by the genome of the 
species. ENDOGENOUS in reference to, for example and not limitation, GPCR, shall mean that 
which is naturally produced by a human, an insect, a plant, a bacterium, or a virus. By contrast, 
the temi NON-ENDOGENOUS in this context shall mean that which is not naturally produced 
20 by the genome of a species. For example, and not limitation, a receptor which is not 
constitutively active in its endogenous fomi, but when mutated by using the cassettes disclosed 
herein and thereafter becomes constitutively active, is most preferably referred to herein as a "non- 
endogenous, constitutively activated receptor." Both temis can be utilized to describe both "in 
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vivo" and "in vitro" systems. For example, and not limitation, in a screening approach, the 
endogenous or non-endogenous receptor may be in reference to an in vitro screening system 
whereby the receptor is expressed on the cell-surface of a mammalian cell. As a fiirther example 
and not limitation, where the genome of a mammal has been manipulated to include a non- 
5 endogenous constitutively activated receptor, screening of a candidate compound by means of an 
in vivo system is viable. 

HOST CELL shall mean a ceU enable of having a Plasmid and/or Vector incorporated 
therein. In the case of a prokaryotic Host Cell, a Plasmid is typically replicated as an autonomous 
molecule as the Host Cell replicates (generally, the Plasmid is thereafter isolated for introduction 
10 into a eukaiyotic Host Cell); in the case of a eukaiyotic Host Cell, a Plasmid is integrated into the 
cellular DNA of the Host Cell such that when the eukaryotic Host Cell replicates, the Plasmid 
replicates. Preferably, for the purposes of the invention disclosed herein, the Host Cell is 
eukaryotic, more preferably, mammalian, and most preferably selected from the group consisting 
of 293, 293T and COS-7 cells. 
15 INDIRECIXYroE]mFYINGorINDIIffiCn.YroE 

approach to the drug discovery process involving identification of an endogenous ligand specific 
for an endogenous receptor, screening of candidate compounds against the receptor for 
determination of those which interfere and/or compete with the ligand-receptor interaction, and 
assessing the efiBcacy of the compound for affecting at least one second messenger pathway 
20 associated with the activated receptor. 

INfflBIT or INHIBrnNG, in relationship to the term "response" shall mean that a 
response is decreased or prevented in the presence of a compound as opposed to in the absence of 
the compound. 
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INVERSE AGONISTS shall mean compounds which bind to either the endogenous fonn 
of the receptor or to the constitutively activated form of the receptor, and which inhibit the 
baseline intracellular response initiated by the active form of the receptor below the normal base 
level of activity which is observed in the absence of agonists or partial agonists, or decrease GTP 
5 binding to membranes. Preferably, the baseline intracellular response is inhibited in the presence 
of the inverse agonist by at least 30%, more preferably by at least 50%, and most preferably by at 
least 75%, as compared with the baseline response in the absence of the inverse agonist. 

KNOWN RECEPTOR shall mean an endogenous receptor for which the endogenous 
ligand specific for that receptor has been identified. 
10 LIGAND shall mean an endogenous, naturally occurring molecule specific for an 

endogenous, naturally occurring receptor. 

MUTANT or MUTATION in reference to an endogenous receptor's nucleic acid and/or 
amino acid sequence shall mean a specified change or changes to such endogenous sequences such 
that a mutated form of an endogenous, non-constitutively activated receptor evidences constitutive 
15 activation of the receptor. In terms of equivalents to specific sequences, a subsequent mutated 
form of a human receptor is considered to be equivalent to a first mutation of the human receptor 
if (a) the level of constitutive activation of the subsequent mutated form of the receptor is 
substantially the same as that evidenced by the first mutation of the receptor; and (b) the percent 
sequence (amino acid and/or nucleic acid) homology between the subsequent mutated form of the 
20 receptor and the first mutation of the receptor is at least about 80%, more preferably at least about 
90% and most preferably at least 95%, Ideally, and owing to tiie fact tiiat the most preferred 
cassettes disclosed herein for achieving constitutive activation includes a single amino acid and/or 
codon change between the endogenous and the non-endogenous forms of the GPCR (i.e. X or 
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Xcodon)> the percent sequence homology should be at least 98%. 

ORPHAN RECEPTOR shall mean an endogenous receptor for which the endogenous 
ligand specific for that receptor has not been identified or is not known. 

PHARMACEUTICAL COMPOSITION shall mean a composition comprising at least 
5 one active ingredient, whereby the composition is amenable to investigation for a specified, 
efficacious outcome in a mammal (for example, and not limitation, a human). Those of ordinary 
skill in the art will understand and appreciate the techniques appropriate for determining whether 
an active ingredient has a desired efficacious outcome based upon the needs of the artisan. 

PLASMID shall mean the combination of a Vector and cDNA. Generally, a Plasmid is 
10 introduced into a Host Cell for the purpose of replication and/or expression of the cDNA as a 
protein. 

STIMULATE or STIMULATING, in relationship to the term "response" shall mean that 
a response is increased in the presence of a compound as opposed to in the absence of the 
compound. 

15 TRANSVERSE or TRANSVERSING, in reference to either a defined nucleic acid 

sequence or a defined amino acid sequence, shall mean that the sequence is located within at least 
two different and defined regions. For example, in an amino acid sequence that is 10 amino acid 
moieties in length, where 3 of the 1 0 moieties are in the TM6 region of a GPCR and the remaining 
7 moieties are in the IC3 region of the GPCR, the 10 amino acid moiety can be described as 

20 transversing the TM6 and ICS regions of the GPCR. 

VECTOR in reference to cDNA shall mean a circular DNA capable of incorporating at 
least one cDNA and enable of incorporation into a Host Cell. 

The order of the following sections is set forth for presentational efficiency and is not 
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intended, nor should be construed, as a limitation on the disclosure or the claims to follow. 
A. Introduction 

The traditional study of receptors has always proceeded fiom the a priori assumption 
(historically based) that the endogenous ligand must first be identified before discovery could 
5 proceed to find antagonists and other molecules that could affect the receptor. Even in cases 
where an antagonist might have been known first, the search immediately extended to looking for 
the endogenous ligand. This mode of thinking has persisted in receptor research even after the 
discovery of constitutively activated receptors. What has not been heretofore recognized is that 
it is the active state of the receptor that is most useful for discovering agonists, partial agonists, and 
1 0 inverse agonists of the receptor. For those diseases which result fi-om an overly active receptor or 
an under-active receptor, what is desired in a therapeutic dmg is a compound which acts to 
diminish the active state of a receptor or enhance the activity of the receptor, respectively, not 
necessarily a drug which is an antagonist to the endogenous ligand. This is because a compound 
that reduces or enhances the activity of the active receptor state need not bind at the same site as 
1 5 the endogenous ligand. Thus, as taught by a method of this invention, any search for therapeutic 
compounds should start by screening compounds against the ligand-independent active state. 

Screening candidate compounds against non-endogenous, constitutively activated GPCRs 
allows for the direct identification of candidate compounds which act at these cell surface 
receptors, without requiring any prior knowledge or use of the receptor's endogenous ligand. By 
20 determining areas within the body where the endogenous version of such GPCRs are expressed 
and/or over-expressed, it is possible to determine related disease/disorder states which are 
associated with the expression and/or over-expression of these receptors; such an approach is 
disclosed in this patent document. 
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B. Disease/Disorder Identification and/or Selection 

Most preferably, inverse agonists to the non-endogenous, constituti vely activated GPCRs 
can be identified using the materials of this invention. Such inverse agonists are ideal candidates 
as lead compounds in drug discoveiy programs for treating diseases related to these receptors. 
5 Because of the ability to direcdy identify inverse agonists, partial agonists or agonists to these 
receptors, thereby allowing for the development of phannaceutical compositions, a search, for 
diseases and disorders associated with these receptors is possible. For example, scanning both 
diseased and nomial tissue samples for the presence of these receptor now becomes more than an 
academic exercise or one which might be pursued along the path of identifying, in the case of an 
10 orphan receptor, an endogenous ligand. Tissue scans can be conducted across a broad range of 
healthy and diseased tissues. Such tissue scans provide a preferred first step in associating a 
specific receptor with a disease and/or disorder. 

Preferably, the DN A sequence of the endogenous GPCR is used to make a probe for either 
radiolabeled cDNA or RT-PCR identification of the expression of the GPCR in tissue samples, 
15 The presence of a receptor in a diseased tissue, or the presence of the receptor at elevated or 
decreased concentrations in diseased tissue compared to a nomial tissue, can be preferably utilized 
to identify a correlation with that disease. Receptors can equally well be localized to regions of 
organs by this technique. Based on the known fiinctions of the specific tissues to which the 
receptor is localized, the putative functional role of the receptor can be deduced. 

20 C. A "Human GPCR Prolme Marker" Algoritiim and the Creation of 
Non-Endogenous, Constitutively-Active Human GPCRs 

Among the many challenges facing the biotechnology arts is the unpredictability in 
gleaning genetic infomfiation fix»m one species and correlating that infomiation to another species 
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- nowhere in this art does this problem evidence more annoying exacerbation than in the gaietic 
sequences that encode nucleic acids and proteins. Thus, for consistency and because of the highly 
unpredictable nature of this art, the following invention is limited, in teraas of mammals, to human 
GPCRs - ^plicability of this invention to other mammalian species, while a potential possibility, 
5 is considered beyond mere rote application. 

In general, when attempting to apply common "rules" from one related protein sequence 
to another or from one species to another, the art has typically resorted to sequence alignment, /. 
sequences are linearized and attempts are then made to find regions of commonality between two 
or more sequences. While useful, this ^proach does not always prove to result in meaningful 
1 0 inforaiation. In the case of GPCRs, while the general structural motif is identical for all GPCRs, 
the variations in lengths of the TMs, ECs and ICs make such alignment approaches from one 
GPCR to another diflScult at best. Thus, while it may be desirable to apply a consistent approach 
to, e,g„ constitutive activation from one GPCR to another, because of the great diversity in 
sequence length, fidelity, etc from one GPCR to the next, a generally applicable, and readily 
15 successfiil mutational alignment ^proach is in essence not possible. In an analogy, such an 
approach is akin to having a traveler start a joumey at point A by giving the traveler dozens of 
different maps to point B, without any scale or distance markers on any of the maps, and then 
asking the traveler to find the shortest and most eflScient route to destination B only by using the 
maps. In such a situation, the task can be readily simplified by having (a) a common "place- 
20 marker" on each map, and (b) the ability to measure the distance from the place-marker to 
destination B - this, then, will allow the traveler to select the most efficient fiiom starting-point A 
to destination B. 

In essence, a feature of the invention is to provide such coordinates within human GPCRs 
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that readily allows for creation of a constitutively active fom of the human GPCRs. 

As those in the art appreciate, the transmembrane region of a cell is highly hydrophobic; 
thus, using standard hydrophobicity plotting techniques, those in the art are readily able to 
determine the TM regions of a GPCR, and specifically TM6 (this same approach is also 
5 ^plicable to detemiining the EC and IC regions of the GPCR). It has been discovered that within 
the TM6 region of human GPCRs, a common proline residue (generally near the middle of TM6), 
acts as a constitutive activation "marker." By counting 15 amino acids fi-om the prohne maiker, 
the 16* amino acid (which is located in the ICS loop), when mutated from its endogenous form 
to a non-endogenous fomi, leads to constitutive activation of the receptor. For convenience, we 
1 0 refer to this as the "Human GPCR Proline Marker" Algorithm. Although the non-endogenous 
amino acid at this position can be any of the amino acids, most preferably, the non-endogenous 
amino acid is lysine. While not wishing to be bound by any theory, we believe that this position 
itself is unique and that the mutation at this location impacts the receptor to allow for constitutive 
activation. 

1 5 We note that, for example, when the endogenous amino acid at the 1 6* position is already 

lysine (as is the case with GPR4 and GPR32), then in order for X to be a non-endogenous amino 
acid, it must be other than lysine; thus, in those situations where the endogenous GPCR has an 
endogenous lysine residue at the 16* position, the non-endogenous version of that GPCR 
preferably incorporates an amino acid other than lysine, preferably alanine, histidine and arginine, 

20 at this position. Of further note, it has been determined that GPR4 appears to be linked to Gs and 
active in its endogenous form (data not shown). 

Because there are only 20 naturally occurring amino acids (although the use of non- 
naturally occurring amino acids is also viable), selection of a particular non-endogenous amino 
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acid for substitution at this 16* position is viable and aUows for efficient selection of a non- 
endogenous amino acid that fits the needs of the investigator. However, as noted, the more 
preferred non-endogenous amino acids at the IG^ position are lysine, hisitidine, arginine and 
alanine, with lysine being most preferred. Those of ordinary skill in the art are credited with the 
ability to readily determine proficient methods for changing the sequence of a codon to achieve 
a desired mutation. 

It has also been discovered that occasionally, but not always, the proline residue marker 
will be preceded in TM6 by W2 (/.e, W2P^AA,5X) where W is tryptophan and 2 is any amino 
acid residue. 

Our discovery, amongst other things, negates the need for unpredictable and complicated 
sequence alignment approaches commonly used by the art. Indeed, the strength of our discovery, 
while an algorithm in nature, is that it can be ^plied in a facile manner to human GPCRs, with 
dexterous simplicity by those in the art, to achieve a unique and highly useful end-product, z.e., a 
constitutively activated version of a human GPCR. Because many years and significant amounts 
of money will be required to determine the endogenous ligands for the human GPCRs that the 
Human Genome proj ect is uncovering, the disclosed invention not only r^uces the time necessary 
to positively exploit this sequence information, but at significant cost-savings. This approach tnrly 
validates the importance of the Human Genome Project because it allows for the utilization of 
genetic information to not only understand the role of the G?CRs in, e.g., diseases, but also 
provides the opportunity to improve the human condition. 
D, Screening of Candidate Compounds 

1. Generic GPCR screening assay techniques 

When a G protein receptor becomes constitutively active, it couples to a G protein (eg.. 
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Gq, Gs, Gi, Go) and stimulates release and subsequent binding of GTP to the G protein. The G 
protein then acts as a GTPase and slowly hydrolyzes the GTP to GDP, whereby the receptor, 
under normal conditions, becomes deactivated. However, constitutively activated receptors, 
including the non-endogenous, human constitutively active GPCRs of the present invention, 
5 continue to exchange GDP for GTP. A non-hydrolyzable analog of GTP, pS]GTPyS, can be 
used to monitor enhanced binding to G proteins present on membranes which express 
constitutively activated receptors. It is reported that pS]GTPyS can be used to monitor G protein 
coupling to membranes in the absence and presence of ligand. An example of this monitoring, 
among other examples well-known and available to those in the art, was reported by Traynor and 
10 Nahorski in 1995. The preferred use of this assay system is for initial screening of candidate 
compounds because the system is generically applicable to all G protein-coupled receptors 
regardless of the particular G protein that interacts with the intracellular domain of the receptor. 

B 2. Specific GPCR screening assay techniques 

C Once candidate compounds are identified using the "generic" G protein- 
15 coupled receptor assay (Le., an assay to select compounds that are agonists, partial 
agonists, or inverse agonists), further screening to confirm that the compounds have 
interacted at the receptor site is prefenred. For example, a compound identified by the 
"generic" assay may not bind to the receptor, but may instead merely "uncouple" the G 
protein fix)m the intracellular domain. 

20 a. GsandGL 

Gs stimulates the enzyme adenylyl cyclase. Gi (and Go), on the other hand, 
inhibit this enzyme. Adenylyl cyclase catalyzes the conversion of ATP to cAMP; thus. 



wo 00/22129 PCT/US99/23938 



-25- 

constitutively activated GPCRs that couple the Gs protein are associated with increased 
cellular levels of cAMP. On the other hand, constitutively activated GPCRs that couple the 
Gi (or Go) protein are associated with decreased cellular levels of cAMP. See, generally, 
"Indirect Mechanisms of Synaptic Transmission," Chpt. 8, From Neuron To Brain (3^** Ed.) 
5 Nichols, J.G. et al eds. Sinauer Associates, Inc. (1992). Thus, assays that detect cAMP can 
be utilized to determine if a candidate compound is, e.g., an inverse agonist to the receptor 
{i.e., such a compound would decrease the levels of cAMP). A variety of approaches known 
in the art for measuring cAMP can be utilized; a most preferred approach relies upon the use 
of anti-cAMP antibodies in an ELISA-based format. Another type of assay that can be 
1 0 utilized is a whole cell second messenger reporter system assay. Promoters on genes drive the 
expression of the proteins that a particular gene encodes. Cyclic AMP drives gene expression by 
promoting the binding of a cAMP-responsi ve DNA binding protein or transcription factor (CREB) 
which then binds to the promoter at specific sites called cAMP response elements and drives the 
expression of the gene. Reporter systems can be constructed which have a promoter containing 
15 multiple cAMP response elements before the reporter gene, e.g., P-galactosidase or luciferase. 
Thus, a constitutively activated Gs-linked receptor causes the accumulation of cAMP that then 
activates the gene and expression of the reporter protein. The r^orter protein such as p- 
galactosidase or luciferase can then be detected using standard biochemical assays (Chen et al. 
1995). With respect to GPCRs that link to Gi (or Go), and thus decrease levels of cAMP, an 
20 approach to the screening of, e.g., inverse agonists, based upon utilization of receptors that link to 
Gs (and thus increase levels of cAMP) is disclosed in the Example section with respect to GPRl 7 
andGPR30. 
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b. Go and Gq, 

Gq and Go are associated with activation of the enzyme phosphohpase C, 
which in turn hydrolyzes the phospholipid PIP2, releasing two intracellular messengers: 
diacycloglycerol (DAG) and inistol 1,4,5-triphoisphate (IP3). Increased accumulation of IP3 
5 is associated with activation of Gq- and Go-associated receptors. See, generally, "Indirect 
Mechanisms of Synaptic Transmission," Chpt. 8, From Neuron To Brain (3"* Ed.) Nichols, 
J.G. et al eds. Sinauer Associates, Inc. (1992). Assays that detect ffj accumulation can be 
utilized to determine if a candidate compound is, e,g., an inverse agonist to a Gq- or Go- 
associated receptor {i.e,, such a compound would decrease the levels of IP3). Gq-associated 

10 receptors can also been examined using an API reporter assay in that Gq-dependent 
phosphohpase C causes activation of genes containing API elements; thus, activated Gq- 
associated receptors will evidence an increase in the expression of such genes, whereby 
inverse agonists thereto will evidence a decrease in such expression, and agonists will 
evidence an increase in such expression. Commercially available assays for such detection 

15 are available. 

E. Medicinal Chemistry 

Generally, but not always, direct identification of candidate compounds is preferably 
conducted in conjunction with compounds generated via combinatorial chemistry techniques, 
whereby thousands of compounds are randomly prepared for such analysis. Generally, the 
20 results of such screening will be compoimds having unique core structures; thereafter, these 
compounds are preferably subjected to additional chemical modification around a preferred 
core structure(s) to further enhance the medicinal properties thereof Such techniques are 
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known to those in the art and will not be addressed in detail in this patent document. 

F. Pharmaceutical Compositions 

Candidate compounds selected for further development can be fomiulated into 
phamiaceutical compositions using techniques well known to those in the art Suitable 
5 pharmaceutically-acceptable carriers are available to those in the art; for example, see Remington 's 
Pharmaceutical Sciences, 16* Edition, 1980, Mack Publishing Co., (Oslo et al., eds.) 

G. Other Utility 

Although a preferred use of the non-endogenous versions of the disclosed human GPCRs 
is for the direct identification of candidate compounds as inverse agonists, agonists or partial 

10 agonists (preferably for use as pharmaceutical agents), these receptors can also be utilized in 
research settings. For example, in vitro and in vivo systems incorporating these receptore can be 
utilized to further elucidate and understand the roles of the receptors in the human condition, both 
normal and diseased, as well understanding the role of constitutive activation as it applies to 
understanding the signaling cascade. A value in these non-endogenous receptors is that their 

15 utility as a research tool is enhanced in that, because of their unique features, the disclosed 
receptors can be used to understand the role of a particular receptor in the human body before the 
endogenous ligand therefor is identified. Other uses of the disclosed receptors will become 
^parent to those in the art based upon, inter alia, a review of this patent document. 

EXAMPLES 

20 The following examples are presented for purposes of elucidation, and not limitation, 

of the present invention. Following tiie teaching of tiiis patent document that a mutational 
cassette may be utilized in the IC3 loop of human GPCRs based upon a position relative to 
a proline residue in TM6 to constitutively activate the receptor, and while specific nucleic acid 
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and amino acid sequences are disclosed herein, those of ordinary skill in the art are credited 

with the ability to make minor modifications to these sequences while achieving the same or 

substantially similar results reported below. Particular approaches to sequence mutations are 

within the purview of the artisan based upon the particular needs of the artisan. 

5 Example 1 

Preparation of Endogenous Human GPCRs 

A variety of GPCRs were utilized in the Examples to follow. Some endogenous human 

GPCRs were graciously provided in expression vectors (as acknowledged below) and other 

endogenous human GPCRs were synthesized de novo using publicly-available sequence 

10 infomiation. 

1. GPRl (GenBank Accession Number: U13666) 

The human cDNA sequence for GPRl was provided in pRcCMV by Brian 
O'Dowd (University ofToronto). GPRl cDNA (1 .4kB fiagment) was excised from thepRcCMV 
vector as a Ndel-Xbal fragment and was subcloned into the Ndel-Xbal site of pCMV vector {see 
1 5 Figure 3). Nucleic acid (SEQ.ID.NO.: 1) and amino acid (SEQ.ID.NO.: 2) sequences for human 
GPRl were thereafter determined and verified 

2. GPR4 (GenBank Accession Numbers: L36148, U35399, U21051) 
The human cDNA sequence for GPR4 was provided in pRcCMV by Brian 

O'Dowd (University ofToronto). GPRl cDNA (1 .4kB fiagment) was excised fcom the pRcCMV 
20 vector as an Apal(blunted)-Xbal fiagment and was subcloned (with most of the 5' untranslated 
region removed) into HindIII(blunted)-XbaI site of pCMV vector. Nucleic acid (SEQ.ID.NO. : 3) 
and amino acid (SEQ.ID.NO.: 4) sequences for human GPR4 were thereafter determined and 
verified. 

3. GPRS (GenBank Accession Number: L36149) 
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The cDNA for human GPRS was generated and cloned into pCMV expression 
vector as follows: PGR was perfonned using genomic DNA as template and rTth polymerase 
(Perldn Elmer) with the buffer system provided by the manufacturer, 0.25 |iM of each primer, and 
0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94**C for 1 min; 64 °C 
5 for Imin; and 72 for 1 .5 min. The 5' PGR primer contained an EcoRI site with the sequence: 
5'-TATGAATTCAGATGCTCTAAACGTCCCTGC-3' (SEQ.ID.NO.: 5) 
and the 3* primer contained BamHI site with the sequence: 
5'-TCCGGATCCACCTGCACCTGCGCCTGCACC-3' (SEQ.ID.NO.: 6). 
The 1 . 1 kb PGR fragment was digested with EcofU and BamHI and cloned into EcoRI-BamHI 
1 0 site of PCMV expression vector. Nucleic acid (SEQ.ID.NO.: 7) and amino acid (SEQ.ID.NO.: 
8) sequences for human GPRS were thereafter determined and verified. 

4. GPR7 (GenBank Accession Number: U22491) 
The cDNA for human GPR7 was generated and cloned into pGMV expression 
vector as follows: PGR condition- PGR was perfonmed using genomic DNA as template and rTth 
1 5 polymerase (Peikin Elmer) with the buffer system provided by the manufacturer, 0.2S ^iM of each 
primer, and 0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94°G for 
1 min; 62°G for Imin; and 72 °C for Imin and 20 sec. The S ' PGR primer contained a HindlU site 
with the sequence: 

5'-GGAAGGTTGGGGGAGGGGAGGTGGGGGGGT-3' (SEQ.ID.NO.: 9) 
20 and the 3 ' primer contained a BamHI site with the sequence: 

5'-GGGGATGGGGAGGGTGGGGGAGTGAGGGTGG-3' (SEQ.ID.NO.: 10). 

The 1 . 1 kb PGR fragment was digested with HindEI and BamHI and cloned into HindlH-BainHI 

site of pGMV expression vector. Nucleic acid (SEQ.ID.NO.: 1 1) and amino acid (SEQ.ID.NO.: 
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li) sequences for human GPR7 were thereafter determined and verified. 

5. GPRS (GenBank Accession Number: U22492) 

The cDNA for human GPRS was generated and cloned into pCMV expression 
vector as follows: PCR was performed using genomic DNA as template and rTth polymerase 
5 (Perkin Elmer) with the buffer system provided by the manufacturer, 0.25 \xM of each primer, and 
0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94**C for 1 min; 62**C 
for Imin; and 72 for Imin and 20 sec. The 5' PCR primer contained an EcoRI site with die 
sequence: 

5'-CGGAATTCGTCAACGGTCCCAGCTACAATG-3' (SEQ.ID.NO.: 13). 

10 and the 3 ' primer contained a BamHI site with the sequence: 

5'-ATGGATCCCAGGCCCTTCAGCACCGeAATAT-3'(SEQ.ID.NO.: 14). 
The 1.1 kb PCR fiagment was digested with EcoRI and BamHI and cloned into EcoRI-BamPII 
site of PCMV expression vector. All 4 cDNA clones sequenced contained a possible 
polymorphism involving a change of amino acid 206 fix)m Arg to Gin. Aside fi-om this 

15 difference,nucleicacid(SEQ.ID.NO.: 15)andaminoacid(SEQ.ID.NO.: 16) sequences forhuman 
GPRS were thereafter determined and verified. 

6. GPR9 (GenBank Accession Number: X9S876) 

The cDNA for human GPR9 was generated and cloned into pCMV expression 
vector as follows: PCR was perfomied using a clone (provided by Brian O'Dowd) as template and 
20 pfii polymerase (Stratagene) with the buffer system provided by the manufacturer supplemented 
with 10% DMSO, 0.25 yM of each primer, and 0.5 mM of each of the 4 nucleotides. The cycle 
condition was 25 cycles of: 94°C for 1 min; 56**C for Imin; and 72*»C for 2.5 min. The 5' PCR 
primer contained an EcoRI site with the sequence: 
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5 '-ACGAATTCAGCCATGGTCCTTGAGGTGAGTGACCACC AAGTGCTAAAT-3 ' 
(SEQ.ID.no.: 17) 

and the 3' primer contained a BamHI site with the sequence: 
5'-GAGGATCCTGGAATGCGGGGAAGTCAG-3' (SEQ.ID.NO.: 1 8). 
5 The 1.2 kb PGR fragment was digested with EcoRI and cloned into EcoRI-Smal site of PCMV 
expression vector. Nucleic acid (SEQ.ID.NO.: 19) and amino acid (SEQ.ID.NO.: 20) sequences 
for human GPR9 were thereafter detennined and verified. 

7. GPR9-6 (GenBank Accession Number: U45982) 
The cDNA for human GPR9-6 was generated and cloned into pCMV expression 
10 vector as follows: PGR was performed using genomic DNA as template and rTth polymerase 
(Perkin Elmer) with the buffer system provided by the manufacturer, 0.25 of each primer, and 
0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94°C for 1 min; 62*^0 
for Imin; and 72 for 1 min and 20 sec. The 5' PGR primer was kinased with the sequence: 
5'-TTAAGCTTGACCTAATGCCATCTTGTGTCC0' (SEQ.ID.NO.: 21) 
15 and the 3 ' primer contained a BamHI site with the sequence: 

5'-TTGGATCCAAAAGAACCATGCACCTCAGAG-3* (SEQ.ID.NO.: 22). 
The 1.2 kb PGR firagment was digested with BamHI and cloned into EcoRV-BamHI site of 
pCMV expression vector. Nucleic acid (SEQ.ID.NO.: 23) and amino acid (SEQ.ID.NO.: 24) 
sequences for human GPR9-6 were thereafter detemiined and verified. 
20 8. GPRIO (GenBank Accession Number: U32672) 

The human cDNA sequence for GPRIO was provided in pRcCMV by Brian 
O'Dowd (University of Toronto). GPRIO cDNA (1.3kB fragment) was excised bom the 
pRcCMV vector as an EcoRI-Xbal fragment and was subcloned into EcoRI-Xbal site of pCMV 
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vector. Nucleic acid (SEQ.E).NO.: 25) and amino acid (SEQ.ID.NO.: 26) sequences for human 
GPRIO were thereafter detemiined and verified. 

9. GPR15 (GenBank Accession Number: U34806) 

The human cDNA sequence for GPR15 was provided in pCDNA3 by Brian 
O'Dowd (University of Toronto). GPR15 cDNA (1.5kB fragment) was excised finm the 
pCDNA3 vector as a Hindlll-Bam fragment and was subcloned into HindOI-Bam site of pCMV 
vector. Nucleic acid {SEQ.ID.NO.: 27) and amino acid (SEQ.ID.NO.: 28) sequences for human 
GPRl 5 were thereafter detennined and verified. . 

10. GPR17 (GenBank Accession Number: Z94154) 

The cDNA for human GPRl 7 was generated and cloned into pCMV expression 
vector as follows: PGR was perfomied using genomic DNA as template and iTth polymerase 
(Perkin Elmer) with the buffer system provided by the manufacturer, 0.25 of each primer, and 
0.2 mM of each 4 nucleotides. The cycle condition was 30 cycles of: 94°C for 1 min; 56**C for 
Imin and 72 **C for 1 min and 20 sec. The 5' PGR primer contained an EcoRI site with the 
sequence: 

5'-CTAGAATTCTGACTCCAGCCAAAGCATGAAT-3' (SEQ.ID.NO.: 29)andthe3' primer 
contained a BamHI site with the sequence: 

5'-GCTGGATCCTAAACAGTCTGCGCTCGGCCT-3' (SEQ.ID.NO.: 30). 
The 1 . 1 kb PGR fragment was digested with EcoRI and BamHI and cloned into EcoRI-BamHI 
site of pCbAV expression vector. Nucleic acid (SEQ.ID.NO.: 31) and amino acid (SEQ.ID.NO.: 
32) sequences for human GPRl 7 were thereafter detem^ied and verified. 

11. GPR18 (GenBank Accession Number: L42324) 

The cDNA for human GPRl 8 was generated and cloned into pCMV expression 
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vector as follows: PCR was perfonned using genomic DNA as template and rTth polymerase 
(Perkin Elmer) with the buffer system provided by the manufacturer, 0.25 nM of each primer, and 
0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94°C for 1 min; 54°C 
for Imin; and 72 for Imin and 20 sec. The 5' PCR primer was kinased with the sequence: 
5 5'-ATAAGATGATCACCCTGAACAATCAAGAT -3' (SEQ.ID.NO.: 33) 
and the 3' primer contained an EcoRI site with the sequence: 
5'-TCCGAATTCATAACATTTCACTGTTTATATTGC-3' (SEQ.ID.NO.: 34). 
The 1.0 kb PCR fragment was digested with EcoRI and cloned into blunt-EcoRI site of pCMV 
expression vector. All 8 cDN A clones sequenced contained 4 possible polymorphisms involving 
10 changes of amino acid 12 from Thr to Pro, amino acid 86 fi^om Ala to Glu, amino acid 97 bom 
He to Leu and amino acid 310 from Leu to Met. Aside from these changes, nucleic acid 
(SEQ.ID.no. : 35) and amino acid (SEQ.ID.NO. : 36) sequences for human GPRl 8 were thereafter 
determined and verified. 

12. GPR20 (GenBank Accession Number: U66579) 
1 5 The cDNA for human GPR20 was generated and cloned into pCMV expression 

vector as follows: PCR was performed using genomic DNA as template and rTth polymerase 
(Perkin Ebner) with the buffer system provided by the manufacturer, 0.25 fiM of each primer, and 
0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94**C for 1 min; 62°C 
for Imin; and 72 °C for 1 min and 20 sec. The 5 ' PCR primer was kinased with the sequence: 
20 5'.CCAAGCTTCCAGGCCTGGGGTGTGCTGG-3' (SEQ.ID.NO.: 37) 
and the 3' primer contained a BamHI site with the sequence: 
5'-ATGGATCCTGACCTTCGGCCCCTGGCAGA-3' (SEQ.ID.NO.: 38). 
The 1.2 kb PCR fragment was digested with BamHI and cloned into EcoRV-BamHI site of 
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PCMV expression vector. Nucleic acid (SEQ.ID.NO.: 39) and amino acid (SEQ.ID.NO.: 40) 
sequences for human GPR20 were thereafter determined and verified. 

13. GPR21 (GenBank Accession Number: U66S80) 
The cDNA for human GPR21 was generated and cloned into pCMV expression 
5 vector as follows: PGR was performed using genomic DNA as template and rTth polymerase 
(Perkin Elmer) with the buffer system provided by die manufacturer, 0.25 nM of each primer, and 
0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94°C for I min; 62**C 
for Imin; and 72 *'C for 1 min and 20 sec. The 5 ' PGR primer was kinased with the sequence: 
5*-GAGAATTGACrGGTGAGGTGAAGATGAAGT-3' (SEQ.ID.NO.: 41) 

10 and the 3 ' primer contained a BamHI site with the sequence: 

5'-GGGGATGGGCGTAACTGAGGGAGTTCAGAT-3' (SEQ.E).NO.: 42). 
The 1.1 kb PGR fragment was digested with BamHI and cloned into EcoRV-BamHI site of 
pCMV expression vector. Nucleic acid (SEQ.ID.NO.: 43) and amino acid (SEQ.ID.NO.: 44) 
sequences for human GPR21 were thereafter detennined and verified. 

1 5 14, GFR22 (GenBank Accession Number: U66581) 

The cDNA for human GPR22 was generated and cloned into pGMV expression 
vector as follows: PGR was performed using genomic DNA as template and rTth polymerase 
(Perkin Elmer) with the buffer system provided by the manufacturer, 0.25 \xM of each primer, and 
0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94**G for I min; 50**G 

20 for Imin; and 72 ^'G for 1.5 min. The 5' PGR primer was kinased with the sequence: 
5'-TCGGGGGGGAAAAAAAGGAAGTGGTGGAAA-3' (SEQ.ID.NO.: 45) 
and the 3' primer contained a BamHI site with the sequence: 
5'-TAGGATGGATTrGAATGTGGATTTGGTGAAA-3* (SEQ.ID.NO.: 46). 
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The 1.38 kb PGR fragment was digested with BamHI and cloned into EcoRV-BamHI site of 
pCMV expression vector. Nucleic acid (SEQ.ID.NO.: 47) and amino acid (SEQ.ID.NO.: 48) 
sequences for human GPR22 were thereafter detemiined and verified. 

15. GPR24 (GenBank Accession Number: U71092) 

The cDNA for human GPR24 was generated and cloned into pCMV expression 
vector as follows: PGR was performed using genomic DNA as template and rTth polymerase 
(Perkin Elmer) with the buffer system provided by the manufacturer, 0.25 of each primer, and 
0.2 mM of each 4 nucleotides. The cycle condition was 30 cycles of: 94*'G for 1 min; 56°C for 
Imin; and 72 °G for 1 min and 20 sec. The 5' PGR primer contains a Hindm site with the 
sequence: 

5*-GTGAAGGTTGGGTGTGGTGGCTGGAGGAGG-3' (SEQ.ID.NO.: 49) 

and the 3' primer contains an EcoRI site with the sequence: 

5'-GGAGAATTGGGGGTGGGGTGTTGTGGTGGGG-3' (SEQ.ID.NO.: 50). 

The 1 .3 kb PGR fragment was digested with HindlD and EcoRI and cloned into HindDI-EcoRI 

site of pGMV expression vector. The nucleic acid (SEQ.ID.NO.: 51) and amino acid sequence 

(SEQ.ID.no.: 52) for human GPR24 were thereafter detenmined and verified. 

16. GPR30 (GenBank Accession Number: U63917) 

The cDNA for human GPR30 was generated and cloned as follows: the coding 
sequence of GPR30 (1 128bp in length) was amplified from genomic DNA using the primers: 
5'-GGGGGATGGATGGATGTGAGTTGGGAA.3' (SEQ.ID.no.: 53) and 
5'-GGGGGATGGCTAGAGGGGAGTGCTGAA-3' (SEQ.ID.NO.: 54). 
The amplified product was then cloned into a commercially available vector, pCR2 . 1 (Invitiogen), 
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using a "TOPO-TA Cloning Kit" (Invitrogen, #K4500-01), following manufacturer instructions. 
The full-length GPR30 insert was liberated by digestion with BamHl, separated fiom the vector 
by agarose gel electrophoresis, and purified using a Sephaglas Bandprep^" Kit (Pharmacia, # 27- 
9285-01) foUowingmanufacturer instructions. Thenucleic acid (SEQ.ID.NO.: 55) and amino acid 
sequence (SEQ.ID.NO.: 56) for human GPR30 were thereafter detennined and verified. 
17. GPR31 (GenBank Accession Number; U65402) 
The cDNA for human GPR3 1 was generated and cloned into pCMV expression 
vector as follows: PGR was perfomied using genomic DNA as template and rTth polymerase 
(Peridn Elmer) v/ith the buffer system provided by the manufacturer, 0.25 |iM of each primer, and 
0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94°C for 1 min; 58*'C 
for Imin; and 72 °C for 2 min. The 5' PGR primer contained an EcoRI site with the sequence: 
5'-AAGGAATTCAGGGCGGGGTGATGCGATTCCC-3' (SEQ.ID.NO.: 57) 
and the 3' primer contained a BamHI site with the sequence: 
5'-GGTGGATCCATAAACACGGGCGTTGAGGAG-3' (SEQ.ID.NO.: 58). 
The 1 .0 kb PGR fiagment was digested with EcoRI and BamHI and cloned into EcoRI-BamHI 
site of pGMV expression vector. Nucleic acid (SEQ.ID.NO.: 59) and amino acid (SEQ.ID.NO.: 
60) sequences for human GPR3 1 were thereafter detemiined and verified 
18. GPR32 (GenBank Accession Number: AF045764) 

The cDNA for human GPR32 was generated and cloned into pGMV expression 
vector as follows: PGR was perfomied using genomic DNA as template and rTth polymerase 
(Pcrkin Etaier) with the buffer system provided by the manufacturer, 0.25 of each primer, and 
0.2 mM of each 4 nucleotides. The cycle condition was 30 cycles of: 94''G for 1 min; 56*^0 for 
Imin; and 72 **G for 1 min and 20 sec. The 5' PGR primer contained an EcoRI site with the 
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sequence: 

5'-TAAGAATTCCATAAAAATTATGGAATGG-3'(SEQ.ID.NO.:243) 
and the 3' primer contained a BamHI site with the sequence: 
5'-CCAGGATCCAGCTGAAGTCTTCCATCATTC-3' (SEQ.ID.NO.: 244). 
5 The 1 . 1 kb PCR fragment was digested with EcoRI and BamHI and cloned into EcoRI-BamHI 
site of pCMV expression vector. Nucleic acid (SEQ.ID.NO.: 245) and amino acid (SEQ.ID.NO.: 
246) sequences for human GPR32 were thereafter determined and verified 

19. GPR40 (GenBank Accession Number: AF024687) 

The cDNA for human GPR40 was generated and cloned into pCMV expression 
10 vector as follows: PCR was perfomied using genomic DNA as template and rTth polymerase 
(Peridn Elmer) with the buffer system provided by the manufacturer, 0.25 \xM of each primer, and 
0.2 mM of each 4 nucleotides. The cycle condition was 30 cycles of: 94°C for 1 min, 65°C for 
Imin and 72 °C for 1 min and 10 sec. The 5' PCR primer contained an EcoRI site with the 
sequence 

15 5'-GCAGAATTCGGCGGCCCCATGGACCTGCCCCC-3* (SEQ.ID.NO.: 247) 
and the 3' primer contained a BamHI site with the sequence 
5'-GCTGGATCCCCCGAGCAGTGGCGTTACTTC-3' (SEQ.ID.NO.: 248). 
The 1 kb PCR fragment was digested with EcoRI and BamHI and cloned into EcoRI-BamHI site 
ofpCMV expression vector. Nucleic acid (SEQ.ID.NO.: 249) and amino acid (SEQ.ID.NO.: 250) 

20 sequences for human GPR40 were thereafter detemiined and verified. 

20. GPR41 (GenBank Accession Number AF024688) 

The cDNA for human GPR41 was generated and cloned into pCMV expression 
vector as follows: PCR was performed using genomic DNA as template and rTth polymerase 
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(Perkin Elmer) with the buffer system provided by the manufacturer, 0.25 of each primer, and 
0.2 mM of each 4 nucleotides. The cycle condition was 30 cycles of 94**C for 1 min, 65°C for 
Imin and 72 °C for 1 min and 10 sec. The 5' PCR primer contained an Hindm site with the 
sequence: 

5 S^CTCAAGOTACTCTCTCTCACCAGTGGCCACO' (SEQ,ID.NO.: 251) 
and the 3' primer was kinased with the sequence 
5'-CCCTCCTCCCCCGGAGGACCTAGC-3* (SEQ.ID.NO.: 252). 

The 1 kb PCR fragment was digested with Hindm and cloned into Hindffl-blunt site of pCMV 
expression vector. Nucleic acid (SEQ.ID.NO.: 253) and amino acid (SEQ.ID.NO.: 254) 
10 sequences for human GPR41 were thereafter detemuned and verified. 

21. GPR43 (GenBank Accession Number AF024690) 
The cDNA for human GPR43 was generated and cloned into pCMV expr^sion 
vector as follows: PCR was performed using genomic DNA as template and rTth polymerase 
(PerkinElmer) with the buffer system provided by the manufacturer, 025 of eachpr^ and 
15 0.2 mM of each 4 nucleotides. The cycle condition was 30 cycles of: 94°C for 1 min; 65°C for 
Imin; and 72 °C for 1 min and 10 sec. The 5' PCR primer contains an HindlH site with the 
sequence: 

5'-TTTAAGCTrCCCCTCCAGGATGCTGCCGGAC-3' (SEQ.ID.NO.: 255) 
and the 3 ' primer contained an EcoRI site with the sequence: 
20 5'.GGCGAATTCTGAAGGTCCAGGGAAACTGCTA-3' (SEQ.ID.NO. 256). 

The 1 kb PCR fragment was digested with Hindm and EcoRI and cloned into Hindm-EcoRI site 
of pCMV expression vector. Nucleic acid (SEQ.ID.NO.: 257) and amino acid {SEQ.ID.NO.: 258) 
sequences for human GPR43 were thereafter deteraiined and verified. 
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22. APJ (GeoBank Accession Number: U03642) 

Human APJ cDNA (in pRcCMV vector) was provided by Brian O'Dowd 
(University of Toronto). The human APJ cDNA was excised &om the pRcCMV vector as an 
EcoRI-Xbal (blunted) fragment and was subcloned into EcoRI-Smal site of pCMV vector. 
Nucleic acid (SEQ.ID.NO.: 61) and amino acid (SEQ.ID.NO,: 62) sequences for human APJ 
were thereafter determined and verified. 

23. BLRl (GenBank Accession Number: X68149) 

The cDNA for human BLRl was generated and cloned into pCMV expression 
vector as follows: PGR was perforaied using thymus cDNA as template and rTth polymerase 
(Perkin Ehner) with the buffer system provided by the manufacturer, 0.25 ^iM of each primer, and 
0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94°C for 1 min; 62°C 
for Imin; and 72**C for 1 min and 20 sec. The 5' PGR primer contained an EcoRI site with the 
sequence: 

5'-TGAGAATTGTGGTGAGTGAGAGGCGGGAGAG-3' (SEQ.ID.NO.: 63): 
and the 3' primer contained a BamHI site with the sequence: 

5'-GCCGGATCCAAGGAAAAGCAGCAATAAAAGG-3'(SEQ.ID,NO.: 64). The 1.2 kb PGR 
fragment was digested v/ith EcoRI and BamHI and cloned into EcoRI-BamHI site of pCMV 
expression vector. Nucleic acid (SEQ.ID.NO.: 65) and amino acid (SEQ.ID.NO.: 66) sequences 
for human BLRl were thereafter determined and verified. 

24. CEPR (GenBank Accession Number: U77827) 

The cDNA for human CEPR was generated and cloned into pCMV expression 
vector as follows: PGR was perfomied using genomic DNA as template and rTth polymerase 
(Perkin Elmer) with the buffer system provided by the manufacturer, 0.25 of each primer, and 
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0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94°C for 1 min; 65°C 
for Imin; and 72 for 1 min and 20 sec. The 5' PGR primer was kinased with the sequence: 
5'-CAAAGCTTGAAAGCrGCACGGTGCAGAGAC-3' (SEQ.ID,Na:67) 
and the 3' primer contained a BamHI site with the sequence: 
5 5'-GCGGATCCCGAGTCACACCCTGGCTGGGCC-3' (SEQ.ID.NO.: 68). 

The 1.2 kb PGR fragment was digested with BamHI and cloned into EcoRV-BamHI site of 
pGMV expression vector. Nucleic acid (SEQ.ro.NO.: 69) and amino acid (SEQ.ID.NO.: 70) 
sequences for human GEPR were thereafter determined and verified. 

25. EBIl (GenBank Accession Number: L31581) 
10 The cDNA for human EBIl was generated and cloned into pGMV expression 

vector as follows: PGR was perfomied using thymus cDNA as template and rTth polymerase 
(Perkin Ehner) with the buffer system provided by the manufacturer, 0.25 of each primer, and 
0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94'*G for 1 min; 62°G 
for Imin; and 72 '^G for 1 min and 20 sec. The 5' PGR primer contained an EcoRI site with the 
15 sequence: 

5'-AGAGAATTGGTGTGTGGTTTTAGGGGGGAG-3' (SEQ.ID.NO.: 71) 
and the 3' primer contained a BamHI site with the sequence: 
5'-GTCGGATGGAGGGAGAAGAGTGGGGTATGG-3' (SEQ.ID.NO.: 72). 
The 1.2 kb PGR fragment was digested with EcoRI and BamHI and cloned into EcoRI-BamK 
20 site of PGMV expression vector. Nucleic acid (SEQ.ID.NO.: 73) and amino acid (SEQ.ID.NO.: 
74) sequences for human EBIl were thereafter determined and verified. 

26. EBI2 (GenBank Accession Number: L08177) 

The cDNA for human EBI2 was generated and cloned into pGMV expression 
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vector as follows: PCR was perfonned using cDNA clone (graciously provided by Kevin Lynch, 
University ofVirginia Health Sciences Center; the vector utilized was not identified by the source) 
as template and pfii polymerase (Stratagene) with the buffer system provided by the manufacturer 
supplemented with 10% DMSO, 0.25 ^iM of each primer, and 0.5 mM of each of the 4 
5 nucleotides. The cycle condition was 30 cycles of: 94**C for 1 min; 60°C for Imin; and 72 °C for 
1 min and 20 sec. The 5' PCR primer contained an EcoRI site with the sequence: 
5'-CTGGAATTCACCTGGACCACCACCAATGGATA-3' (SEQ.ID.NO.: 75) 
and the 3' primer contained a BamHI site with the sequence 
5'-CTCGGATCCTGCAAAGTTTGTCATACAG TT-3' (SEQ.ID.NO.: 76). 
10 The 1 .2 kb PCR fragment was digested with EcoRI and BamHI and cloned into EcoRI-BamHI 
site of pCMV expression vector. Nucleic acid (SEQ.ID.NO.: 77) and amino acid (SEQ.ID.NO.: 
78) sequences for human EBI2 were thereafter detemiined and verified. 

27. ETBR-LP2 (GenBank Accession Number: D38449) 
The cDNA for human ETBR-LP2 was generated and cloned into pCMV 
15 expression vector as follows: PCR was performed using brain cDNA as template and rTth 
polymerase (Perkin Elmer) with the buffer system provided by the manufacturer, 0.25 \M of each 
primer, and 0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94T for 
1 min; 65*'C for Imin; and 72 *»C for 1.5 min. The 5' PCR contained an EcoRI site with tiie 
sequence: 

20 5'-CTGGAATTCTCCTGCTCATCCAGCCATGCGG -3' (SEQ.ID.NO.: 79) 
and tiie 3' primer contained a BamHI site with the sequence: 
5'.CCTGGATCCCCACCCCTACTGGGGCCTCAG-3' (SEQ.ID.NO.: 80). 
The 1 .5 kb PCR Augment was digested with EcoRI and BamHI and cloned into EcoRI-BamHI 
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site of pCMV expression vector. Nucleic acid (SEQ.DD.NO.: 81) and amino acid 
(SEQ.ID.NO.: 82) sequences for human ETBR-LP2 were thereafter determined and verified. 

28. GHSR (GenBank Accession Number: U60179) 

The cDNA for human GHSR was generated and cloned into pCMV expression 
5 vector as follows: PGR was performed using hippocampus cDNA as template and TaqPlus 
Precision polymerase (Stratagene) with the buffer system provided by the manufacturer, 0.25 \M 
of each primer, and 0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 
94"C for 1 min; 68*C for Imin; and 72 ^'C for 1 min and 10 sec. For first round PGR, the 5' PGR 
primer sequence was: 
10 5*-ATGTGGAAGGGGACGGGGAGGG-3' (SEQ.ID.NO.: 83) 
and the 3' primer sequence was: 

5'-TGATGTATTAATAGTAGATTGT-3' (SEQ.ED,NO.: 84). 

Two microliters of the first round PGR was used as template for the second round PGR where the 
5' primer was kinased with sequence: 
15 5'-TACGATGTGGAAGGGGAGGGGCAGGGAAGAGGGGGGGT-3'(SEQ.ID.NO.:85) 
and the 3' primer contained an EcoRI site with the sequence: 

5*-GGGAATTGATGTATTAATAGTAGATTGTGTGGAGGCGGG-3*(SEQ.ID.NO.:86). 
The 1.1 kb PGR fi-agment was digested with EcoRI and cloned into blunt-EcoRI site of pGMV 
expression vector. Nucleic acid (SEQ.E).NO.: 87) and amino acid (SEQ.E).NO.: 88) sequences 
20 for human GHSR were thereafter determined and verified 

29. GPCR-CNS (GenBank Accession Number: AF017262) 
The cDNA for human GPGR-GNS was generated and cloned into pCMV 
expression vector as follows: PGR was performed using brain cDNA as template and rTth 
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polymerase (Perkin Elmer) with the buffer system provided by the manufacturer, 0.25 of each 
primer, and 0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94**C for 
1 min; eS'^C for Imin; and 72 for 2 min. The 5' PCR primer contained a Hindm site with the 
sequence: 

5 5'"GCAAGCTTGTGCCCTCACCAAGCCATGCGAGCC-3' (SEQ.ID.NO.: 89) 
and the 3' primer contained an EcoRI site with the sequence: 
5^CGGAATTCAGCAATGAGTTCCGACAGAAGC-3' (SEQ.ID,NO.: 90). 
The 1.9 kb PCR j&^gment was digested with HindHI and EcoRI and cloned into Hindm-EcoRI 
site of pCMV expression vector. All nine clones sequenced contained a potential polymorphism 
10 involving a S284C change. Aside from this difference, nucleic acid (SEQ.ID.NO.: 91)andamino 
acid (SEQ.ID.no. : 92) sequences for human GPCR-CNS were thereafter determined and verified. 
30. GPR.NGA(GenBank Accession Number: U55312) 
The cDNA for human GPR-NGA was generated and cloned into pCMV 
expression vector as follows: PCR was performed using genomic DNA as template and rTth 
1 5 polymerase (Perkin Elmer) with the buffer system provided by the manufacturer, 0.25 |iM of each 
primer, and 0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of 94*'C for 
1 min, 56**C for Imin and 72*»C for 1.5 min. The 5' PCR primer contained an EcoRI site with the 
sequence: 

5'-CAGAATTCAGAGAAAAAAAGTGAATATGGTTTTT-3' (SEQ.1D.N0.: 93) 
20 and the 3 * primer contained a BamHI site with the sequence: 

5'.TrGGATCCCTGGTGCATAACAATTGAAAGAAT-3' (SEQ.ID.Na: 94). 

The 1.3 kb PCR fragment was digested with EcoRI and BamHI and cloned into EcoRI-BamHI 

site of pCMV expression vector. Nucleic acid (SEQ.ID.NO.: 95) and amino acid (SEQ.ID.NO.: 
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96) sequences for human GPR-NGA were thereafter determined and verified. 
31. H9 (GenBank Accession Number: U52219) 

The cDNA for human HB954 was generated and cloned into pCMV expression 
vector as follows: PGR was perfomied using pituitary cDNA as template and rTth polymerase 
(Perkin Elmer) with the buffer system provided by the manufacturer, 0.25 ^iM of each primer, and 
0.2 mM of each 4 nucleotides. The cycle condition was 30 cycles of: 94°C for 1 min, 62°C for 
Imin and 72 °C for 2 min. The 5' PGR primer contains a Hindin site with the sequence: 
5'-GGAAAGGTTAAGGATGGGGAGGAGGAAGAT-3' (SEQ.ID.NO.: 97) 
and the 3' primer contains a BamHI site with the sequence: 
5'-GTGGGATGGTAGGAGAGGATTTTTGACAGAG-3' (SEQ.ID.NO.: 98). 
The 1.9 kb PGR fragment was digested with Hindlll and BamHI and cloned into Hindlll- 
BamHI site of pGMV expression vector. When compared to the published sequences, a 
different isoform with 12 bp in frame insertion in the cytoplasmic tail was also identified and 
designated "H9b." Both isoforms contain two potential polymorphisms involving changes 
of amino acid P320S and amino acid G448A. Isoform H9a contained another potential 
polymorphism of amino acid S493N, while isoform H9b contained two additional potential 
polymorphisms involving changes of amino acid I502T and amino acid A532T 
(conresponding to amino acid 528 of isoform H9a). Nucleic acid (SEQ.ID.NO.: 99) and 
amino acid (SEQ.ID.NO.: 100) sequences for human H9 were thereafter determined and 
verified (in the section below, both isofomis were mutated in accordance with the Human 
GPGR Proline Marker Algorithm). 

32. HB954 (GenBank Accession Number: D38449) 

The cDNA for human HB954 was generated and cloned into pCMV expression 
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vector as follows: PCR was perfomied using brain cDNA as template and rXth polymerase (Perkin 
Elnier) with the buffer system provided by the manufacturer, 0.25 |iM and 0.2 mM 

of each of the 4 nucleotides. The cycle condition was 30 cycles of 94**C for 1 min, 58'C for Imin 
and 72 for 2 min. The 5' PCR contained a Hin dTTT site with the sequence: 
5 5'-TCCAAGCTTCGCCATGGGACATAACGGGAGCT -3* (SEQ.K).NO.: 101) 
and the 3' primer contained an EcoRI site with the sequence: 
5^CGTGAATTCCAAGAATTTACAATCCTTGCT.3* (SEQ.ID.NO.: 102). 
The 1.6 kb PCR fragment was digested with Hindlll and EcoRI and cloned into Hindlll- 
EcoRI site of pCMV expression vector. Nucleic acid (SEQ.ID.NO.: 103) and amino acid 
10 (SEQ.ID.NO.: 104) sequences for human HB954 were thereafter determined and verified. 
33. HG38 (GenBank Accession Number: AF062006) 
The cDNA for human HG38 was generated and cloned into pCMV expression 
vector as follows: PCR was perfomied using brain cDNA as template and rTth polymerase (Peiidn 
Ehner) with the buffer system provided by the manufacturer, 0.25 ^iM of each primer, and 0.2 mM 
15 of each 4 nucleotides. The cycle condition was 30 cycles of 94**C for 1 min, 56°C for Imin and 
72 °C for 1 min and 30 sec. Two PCR reactions were perfonned to separately obtain the 5' and 
3 ' fragment. For the 5 * fragment, the 5 ' PCR primer contained an HindlH site with the sequence: 
S'.CCCAAGCTTCGGGCACCATGGACACCTCCC-3' (SEQ.ID.NO.: 259) 
and the 3' primer contained a BamHIsite with the sequence: 
20 5'-ACAGGATCCAAATGCACAGCACTGGTAAGC-3' (SEQ.ID.NO.: 260). 

This 5 * 1 .5 kb PCR fragment was digested with Hindin and BamHI and cloned into an Hindm- 
BamHI site of pCMV. For the 3' fragment, the 5' PCR primer was kinased with the sequence: 
5'-CTATAACTGGGTTACATGGTTTAAC-3* (SEQ.ID.NO. 261) 
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and the 3 ' primer contained an EcoRI site with the sequence: 
5'-TTTGAATTCACATATTAATTAGAGACATGG-3' (SEQ.ID.NO.: 262). 
The 1.4 kb 3' PCR fragment was digested with EcoRI and subcloned into a blunt-EcoRI site of 
pCMV vector. The5' and3' fragments were then ligated together through a common EcoRV site 
5 to generate the full length cDNA clone. Nucleic acid (SEQ.ID.NO.: 263) and amino acid 
(SEQ.ID.no.: 264) sequences for human HG38 were thereafter detemiined and verified. 

34. HM74 (GenBank Accession Number: D10923) 

The cDNA for human HM74 was generated and cloned into pCMV expression 
vector as foUows: PCR was perfomied using either genomic DNA or thymus cDNA (pooled) as 
10 template and rTth polymerase (Perkin Elmer) with the buffer system provided by the 
manufacturer, 0.25 ^M of each primer, and 0.2 mM of each of the 4 nucleotides. The cycle 
condition was 30 cycles of: 94''C for 1 min; 65''C for Imin; and 72 °C for 1 min and 20 sec. ITie 
5' PCR primer contained an EcoRI site with the sequence: 

5'-GGAGAATTCACTAGGCGAGGCGCTCCATC-3' (SEQ.ID.NO.: 105) 
15 and the 3 ' primer was kinased with the sequence: 

5'-GGAGGATCCAGGAAACCTTAGGCCGAGTCC-3'(SEQ.IDJ^O.:106). 
The 1.3 kb PCR fragment was digested with EcoRI and cloned into EcoRI-Smal site of 
pCMV expression vector. Clones sequenced revealed a potential polymoiphism involving a 
N94K change. Aside from this difference, nucleic acid (SEQ.ID.NO.: 107) and amino acid 
20 (SEQ.ID.NO.: 108) sequences for human HM74 were thereafter detennined and verified. 

35. MIG (GenBank Accession Numbers: AFO44600 and AFO44601) 
The cDNA for human MIG was generated and cloned into pCMV expression 
vector as follows: PCR was perfomied using genomic DNA as template and TaqPIus Precision 
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polymerase (Stratagene) for first round PCR orpfu polymerase (Stratagene) for second round PCR 
with the buffer system provided by the manufacturer, 0.25 ^iM of each primer, and 0.2 mM 
(TaqPlus Precision) or 0.5 mM (pfii) of each of the 4 nucleotides. When pfu was used, 10% 
DMSO was included in the buffer. The cycle condition was 30 cycles of 94**C for 1 min; 65°C 
5 for Imin; and 72 for: (a) 1 min for first round PCR; and (b) 2 min for second round PCR. 
Because there is an intron in the coding region, two sets of primers were separately used to 
generate overlapping 5' and 3' fiagments. The 5' fiagment PCR primers were: 

5'-ACCATGGCTTGCAATGGCAGTGCGGCCAGGGGGCACT-3' (external sense) 
(SEQ.ID.no.: 109) 
10 and 

5'-CGACCAGGACAAACAGCATCTTGGTCACTTGTCTCCGGC-3Xintemal antisense) 

(SEQ.rD.NO.:110). 

The 3' fi-agment PCR primers were: 

5'.GACCAAGATGCTGTTTGTCCTGGTCGTGGTGTTTGGCAT-3' (internal sense) 
15 (SEQ.ID.no,: 111) and 

5'-CGGAATTCAGGATGGATCGGTCTCTTGCTGCGCCT-3' (external antisense with an 
EcoRI site) (SEQ.ID.NO.: 1 12). 

The 5* and 3' fiagments were ligated together by using the first round PCR as template and the 
kinased external sense primer and external antisense primer to perform second round PCR. The 
20 1 .2 kb PCR fi^gmenl was digested with EcoRI and cloned into the blunt-EcoRI site of pCMV 
expression vector. Nucleic acid (SEQ.n).NO.: 113) and amino acid (SEQ.1D.N0.: 114) 
sequences for human MIG were thereafter determined and verified., 
36. OGRl (GenBank Accession Number: U4g405) 
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The cDNA for human OGRl was generated and cloned into pCMV expression 
vector as follows: PCR was performed using genomic DNA as template and rTth polymerase 
(Perkin Elmer) with the buffer system provided by the manufacturer, 0.25 of each primer, and 
0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94°C for 1 min; 65°C 
5 for Imin; and 72 ""C for 1 min and 20 sec. The 5* PCR primer was kinased with the sequence: 
5'-GGAAGCTTCAGGCCCAAAGATGGGGAACAT-3' (SEQ.ID.NO.: 115): 
and the 3' primer contained a BamHI site with the sequence: 
5'-GTGGATCCACCCGCGGAGGACCCAGGCTAG"3' (SEQ.ID.NO.: 116). 
The 1 . 1 kb PCR fragment was digested with BamHI and cloned into the EcoRV-BamHI site 
10 ofpCMV expression vector. Nucleic acid (SEQ.ID.NO.: 1 17) andamino acid (SEQ.ID.NO.: 
118) sequences for human OGRl were thereafter determined and verified. 
37. Serotonin SWTja 

The cDNA encoding endogenous human SHTja receptorwas obtained by RT-PCR 
using human brain poly-A* RNA; a 5* primer from the 5' untranslated region with an Xho I 

15 restriction site: 

5'.GACCTCGAGTCCTTCTACACCrCATC-3' (SEQ.ID.NO: 119) 

and a 3' primer from the 3* untranslated region containing an Xba I site: 

5'-TGCTCTAGATTCCAGATAGGTGAAAACTTG-3' (SEQ.ID.NO: 120) 

PCR was perfomied using either TaqPlus™ precision polymerase (Stratagene) or rTth™ 

20 polymerase (Perkin Elmer) with the buffer system provided by the manufacturers, 0.25 ^M of each 
primer, and 0.2 mM of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94°C for 
1 min; 57 *»C for Imin; and 72 **C for 2 min. The 1 .5 kb PCR fragment was digested with Xba I 
and subcloned into Eco RV-Xba I site of pBluescript. The resulting cDNA clones were fully 
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sequenced and found to encode two amino acid changes from the published sequences. The jSrst 
one was a T25N mutation in the N-temiinal extracellular domain; the second is an H452Y 
mutation. Because cDNA clones derived from two independent PCR reactions using Taq 
polymerase from two different commercial sources (TaqPlus''^ from Stratagene and rTth*"^ Peridn 
5 Elmer) contained the same two mutations, these mutations are likely to represent sequence 
polymorphisms rather than PCR errors. With these exceptions, the nucleic acid (SEQ.ID.NO.: 
121) and amino acid (SEQ.ID.NO.: 122) sequences for human 5HT2A were thereafter determined 
and verified. 

38. Serotonin SHTjc 

10 The cDNA encoding endogenous human SHTic receptor was obtained from 

- human brain poly-A" RNA by RT-PCR. The 5' and 3* primers were derived from the 5' and 3' 
untranslated regions and contained the following sequences: 
5'-GACCTCGAGGTTGCtTAAGACTGAAGC-3' (SEQ.ID.NO.: 123) 
5'-ATTTCTAGACATATGTAGCTTGTACCG-3' (SEQ.ID.NO.: 124) 

1 5 Nucleic acid (SEQ.ID.NO.: 125) and amino acid (SEQ.ID.NO.: 126) sequences for human SHTzc 
were thereafter determined and verified. 

39. V28 (GenBank Accession Number: U20350) 

The cDNA for human V28 was gen^ated and cloned into pCMV expression 
vector as follows: PCR was performed using brain cDNA as template and rTth polymerase (Perkin 
20 Etaier) with the buffer system provided by the manufacturer, 0.25 pM of each primer, and 0.2 mM 
of each of the 4 nucleotides. The cycle condition was 30 cycles of: 94*C for 1 min; 65°C for Imin; 
and 72 **C for 1 min and 20 sec. The 5' PCR primer contained a HindUI site with the sequence: 
5'-GGTAAGCTTGGCAGTCCACGCCAGGCCTTC-3' (SEQ.ID.NO.: 127) 
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and the 3' primer contained an EcoRI site with the sequence: 
5'-TCCGAATTCTCTGTAGACACAAGGCTTTGG-3' (SEQ.ID.NO.: 128) 
The 1.1 kb PGR fragment was digested with Hindm and EcoRI and cloned into Hindm-EcoRI 
site of pCMV expression vector. Nucleic acid (SEQ.ID.NO.: 129) and amino acid (SEQ.ID.NO.: 
5 130) sequences for human V28 were thereafter detenmined and verified. 



Example 2 

Preparation of Non-Endogenous Human GPCRs 
1 . Site-Directed Mutagenesis 

Mutagenesis based upon the Human GPCR Proline Marker approach disclosed herein was 
10 performed on the foregoing endogenous human GPCRs using Transformer Site-Directed 
Mutagenesis Kit (Clontech) according to the manufacturer instructions. For this mutagenesis 
^proach, a Mutation Probe and a Selection Maiker Probe (unless otherwise indicated, the probe 
of SEQ.ID.NO.: 132 was the same throughout) were utilized, and the sequences of these for the 
• specified sequences are listed below in Table B (the parenthetical number is the SEQ. ID.NO.). 
1 5 For convenience, the codon mutation incorporated mto the human GPCR is also noted, in standard 
form: 



Table B 





Receptor Identifier 
(Codon Mutation) 


Mutation Probe Sequence 
(5'-3') 
(SEQ,ID.NO.) 


Selection Marker Probe 
Sequence (5'-3*) 
(SEQ.ID.Na) 


2( 


iGPRl 
(F245K) 


GATCTCCAGTAGGCATAAGT 

GGACAATTCTGG 

(131) 


CrCCrrCGGTCCTCCrATCGT 

TGTCAGAAG 

(132) 




GPR4 

(K223A) 


AGA/.GGCCAAGATCGCGCGG 

CTGGCCCrCA 

(133) 


CTCCrrCGGTCCrCCTATCGT 
TGTCAGAAGT 


2f 


GPRS 
(V224K) 


CGGCGCCACCGCACGAAAAA 
GCTCATCTTC 


CrCCTTCGGTCCrCCTATCGT 
TGTCAGAAGT 
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(134) 




GPR7 
(T250K) 


GCCAAGAAGCGOGTGAAGTT 

CCTGGTGGTGGCA 

(135) 


CTC(nTCGGTCCTCCTATCGT 
TGTCAGAAGT 



GPRS 
(T259K) 



CAGGCGGAAGGTGAAAGTCC 

TGGTCXnCGT 

(136) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



30GPR9 
(M254K) 



CGGCGCCTGCGGGCCAAGCG 

GCTGGTGGTGGTG 

(137) 



CrcCTrCGGTCCTCCTATCGT 
TGTCAGAAGT 



GPR9-6 
(L241K) 



CCAAGCACAAAGCCAAGAAA 

GTGACCATCAC 

(138) 



CTCCITCGGTCCTCCTATCGT 
TGTCAGAAGT 



3i 



GPRIO 
(F276K) 



GCGCCGGCGCACCAAATGCr 

TGCTGGTGGT 

(139) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



GPR15 
(I240K) 



GPR17 
(V234K) 



CAAAAAGCTGAAGAAATCTA 
AGAAGATCATCTTTATTGTCG 
(140) 



CAAGACCAAGGCAAAACGCA 

TGATCGCCAT 

(141) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



40GPR18 
(I231K) 



GTCAAGGAGAAGTCCAAAAG 

GATCATCATC 

(142) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



GPR20 
(M240K) 



CGCCGCGTGCGGGCCAAGCA 

GCTCCTGCTC 

(143) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



4t 



GPR21 
(A251K) 



CCTGATAAGCGCTATAAAAT 

GGTCCTGTTTCGA 

(144) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



GPR22 
(F312K) 



GAAAGACAAAAGAGAGTCA 
AGAGGATGTCTTTATTG 
(145) 



CTCCrrCGGTCCTCCTATCGT 
TGTCAGAAGT 



GPR24 
(T304K) 



CGGAGAAAGAGGGTGAAAC 

GCACAGCCATCGCC 

(146) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



50 GPR30 
(L258K) 



alternate ^proach; see below 



alternate ^proach; see below 



GPR31 
(Q221K) 



AAGCTTCAGCGGGCCAAGGC 

ACTGGTCACC 

(147) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



5i 



GPR32 
(K255A) 



CATGCCAACCGGCCCGCGAG 

GCTGCTGCTGGT 

(279) 



ACCAGCAGCAGCCTCGCGGG 

CCGGTTGGCATG 

(280) 



GPR40 
(A223K) 



CGGAAGCTGCGGGCCMATG 

GGTGGCCGGC 

(265) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



GPR41 



CAGAGGAGGGTGAAGGGGCT 
GTTGGCG 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 
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(A223K) 


(266) 




GPR43 
(V221K) 


GGCGGCGCCGAGCCAAGGGG 

CTGGCTGTGG 

(267) 


CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 


APJ 
(L247K) 


alternate approach; see below 


alternate ^proach; see below 


BLRl 
(V258K) 


c:agcggcagaaggcaaaaa 
gggtggccatc 

(148) 


CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 


CEPR 

(U58K) 


CGGCAGAAGGCXjAAGCGCAT 

GATCCTCGCG 

(149) 


CTCCTTCGGTCCrCCTATCGT 
TGTCAGAAGT 



KfEBIl 
(I262K) 



GAGCGCAACAAGGCC AAAA 

AGGTGATCATC 

(150) 



CTCCrrrCGGTCCTCCTATCGT 
TGTCAGAAGT 



EBI2 
(L243K) 



GGTGTAAACAAAAAGGCTAA 
AAACACAATTATTCTTATT 
(151) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



ETBR-LP2 
(N358K) 



GAGAGCCAGCTC AAGA GCAC 

CGTGGTG 

(152) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



GHSR 
(V262K) 



CCACAAGCAAACC AAG AAAA 

TGCTGGCTGT 

(153) . 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



GPCR-CNS 
(N491K) 



CTAGAGAGTCAGATGAAGTG 

TACAGTAGTGGCAC 

(155) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



2() GPR-NGA 

(I275K) 



CGGACAAAAGTGAAAACTAA 

AAAGATGTTCCTCATT 

(156) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



H9aandH9b 
(F236K) 



GCTGAGGTTCGCAATAAACT 

AACCATGTTTGTG 

(157) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



2i 



HB954 
{H265K) 



GGGAGGCCGAGCTGAAAGCC 

ACCCTGCTC 

(158) 



CTCCrTCGGTCCTCCTATCGT 
TGTCAGAAGT 



HG38 
(V765K) 



(KjGACTGCTCTATG AAAA AA 

CACATTGCCCTG 

(268) 



CATCAAGTGTATCATGTGCC 

AAGTACGCCC 

(154) 



HM74 
(I230K) 



CAAGATCAAGAGAGCCAAAA 

CCTTCATCATG 

(159) 



CrCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



3() MIG 
(T273K) 



CCGGAGACAAGTGAAGAAG 

ATGCTGTTrGTC 

(160) 



CTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



OGRl 
(Q227K) 



GCAAGGACCAGATCAAGCGG 

CTGGTGCTCA 

(161) 



CTTCCTTCGGTCCTCCTATCGT 
TGTCAGAAGT 



3^ 



Serotonin SHTja 
(C322K) 



alternate approach; see below 



alternate ^proach; see below 



Serotonin SHTzc 
(S310K) 



alternate approach; see below 



alternate approach; see below 
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V28 


CAAGAAAGCCAAAGCCAAG 


CTCCITCGGTCCrCCTATCGT 


(I230K) 


AAACTGATCCTTCTG 


TGTCAGAAGT 


(362) 





The non-endogenous human GPCRs were then sequenced and the derived and verified nucleic 
acid and amino acid sequences are listed in the accompanying "Sequence Listing" appendix 
5 to this patent document, as summarized in Table C below: 



Table C 



Mutated GPCR 


Nucleic Acid Sequence 
Listing 


Amino Acid Sequence 
Listing 


GPRl 
(F245K) 


SEQ.E>.NO.: 163 


SEQ.E>.N0.: 164 


0 GPR4 
(K223A) 


SEQ.]D.NO.: 165 


SEQ.IDJSrO.: 166 


GPR5 

(V224K) 


SEQ.ID.no.: 167 


SEQ.ID.NO.: 168 


GPR7 
(T250K) 


SEQ.ID.no.: 169 


SEQ.ID.no.: 170 


GPRS 
(T259K) 


SEQ.ID.no.: 171 


SEQ.ID.no.: 172 


GPR9 
(M254K) 


SEQ.ID.NO.: 173 


SEQ.ID.no.: 174 


(1 GPR9-6 
(L241K) 


SEQ.E>.NO.: 175 


SEQ.ID2J0.: 176 


GPRIO 
(F276K) 


SEQ.ID.no.: 177 


SEQ.ID.no.: 178 


GPR15 
i (I240K) 


SEQ.ID.no.: 179 


SEQ.ID.N0.: 180 


GPRl 7 
{V234K) 


SEQ.ID.NO.: 181 


SEQ.ID.no.: 182 


GPRl 8 
(I231K) 


SEQ.E).N0.: 183 


SEQ.ID.no.: 184 


[IGPR20 
(M240K) 


SEQ.E).NO.: 185 


SEQ.K>.N0.: 186 


GPR21 
(A251K) 


SEQ.ID.no.: 187 


SEQ.K).N0.: 188 


GPR22 
(F3I2K) 


SEQ.ID.no.: 189 


SEQ.ID.NO.: 190 


GPR24 
(T304K)) 


SEQ.E>.NO.: 191 


SEQ.ID.no.: 192 


GPR30 


SEQ.ID.no.: 193 


SEQ.ID.N0.: 194 
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{L258K) 






GPR31 


SEQ.ID.no.: 195 


SEQ.K).NO.: 196 


(Q221K) 




GPR32 














SEQ.E).NO.: 271 


SEQ.ID.NO.: 272 






ljrK4 1 


SEQ.n).NO.: 273 


SEQ.ID.no.: 274 








SEQ.ID.no.: 275 


SEQ.ID.no.: 276 






ArJ 


SEQ.ID.no.: 197 


SEQ.ID.no.: 198 








SEQ.ID.NO.: 199 


SEQ.IDJ^0.: 200 






LcrK 


SEQ.ID.NO.: 201 


SEQ.ID.no.: 202 








SEQ.ID.NO.: 203 


SEQ.ID.no.: 204 








SEQ.ID.NO.: 205 


SEQ.ID.no.: 206 






ETBR-LP9 


oIjv.LU.XNU.. zu/ 


^FO TT* TSin • 9ftJl 
oCV^.lI/JNvJ.. L\JO 


(N358K) 




GHSR 


oiiy.iu.lNU.. zuy 




' fV262K> 






cpn rn mh ■ 0 1 1 

oIiy.UJ.lNU.. Zli 


^FO in MO * IM 


(N491K) 






oHy.LU.INU.. aI J 


QFn m \in • '>id 


(I275K) 




1 H9a 


OHV^.UJ.lNU.. Zlj 


^Fn rn Mn "91/=; 


(F236K) 




H9b 


SEQ.ID.no.: 217 


SEQ.ID.no.: 218 


(F236K) 


HB954 


SEQ.ID.NO.: 219 


SEQJD.no.: 220 


(H265K) 






HG38 


SEQ.ID.no.: 277 


SEQ.ID.no.: 278 






HM74 


SEQ.ID.no.: 221 


SEQ.DD.no.: 222 


(I230K) 




^MIG 


SEQ.K>.NO.:223 


SEQ.IDJ^fO.:224 


{T273K) 




(XjRI 


SEQ.ID.no.: 225 


SEQ.ID.no.: 226 


(Q227K) 


Serotonin SHTza 


SEQ.ID.no.: 227 


SEQ.ID.no.: 228 


(C322K) 






Serotonin SHTjc 


SEQ.ro.NO.:229 


SEQ.1D.no.: 230 


(S310K) 






V28 


SEQ.E).NO.:231 


SEQ.ID.NO.: 232 


(I230K) 
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2. Alternate MutatioD Approaches for Employment of the Prolme Marker 
Algorithm: APJ; Serotonin SHTja; Serotonin SHTzc; and GPR30 

Although the above site-directed mutagenesis approach is particularly preferred, other 
approaches can be utilized to create such mutations; those skilled in the art are readily credited 
5 with selecting approaches to mutating a GPCR that fits within the particular needs of the artisan, 
a APJ 

Preparation of the non-endogenous, human APJ receptor was accomplished by 
mutating L247K. Two oligonucleotides containing this mutation were synthesized: 
5'- GGCTTAAGAGCATCATCGTGGTGCTGGTG-3' (SEQ.E).NO.: 233 ) 
10 5*-GTCACCACCAGCACCACGATGATGCTCTTAAGCC-3' (SEQ.ID.NO.: 234) 

The two oligonucleotides were annealed and used to replace the Nael-BstEII fragment of human, 
endogenous APJ to generate the non-endogenous, version of human APJ. 
b. Serotonin SHT^ 
cDNA containing the point mutation C322K was constructed by utilizing the restriction 
1 5 enzyme site Sph I which encompasses amino acid 322. A primer containing the C322K mutation: 
5'-CAAAGAAAGTACTGGGCATCGTCTTCTTCCT-3' (SEQ.ID.NO: 235) 
was used along with the primer from the 3 ' untranslated region of the receptor 
5'-TGCTCTAGATTCCAGATAGGTGAAAA CTTG-3' (SEQ.ID.NO.: 236) 
to perform PGR (under the conditions described above). The resulting PGR fragment was then 
20 used to replace the 3' end of endogenous 5HT2A cDNA through the T4 polymerase blunted Sph 
I site. 

c Serotonin 5HT2c 
The cDNA containing a S3 1 OK mutation was constmcted by replacing the Sty I restriction 
fragment containing amino acid 3 1 0 with synthetic double stranded oligonucleotides that encode 
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the desired mutation. The sense strand sequence utilized had the foUowing sequence: 
5'-CTAGGGGCACCATGCAGGCTATCAACAATGAAAGAAAAGCTAAGAAAGTC^ 
(SEQ. JD,NO.: 237) 

and the antisense strand sequence utilized had the following sequence: 
5 S^CAAGGACTTTCTTAGCTTTTCm (SEQ. 
ID. NO.: 238) 

tL GPR30 

Prior to generating non-endogenous GPR30, several independent pCR2.1/GPR30 isolates 
were sequenced in their entirety in order to identify clones with no PCR-generated mutations. A 

1 0 clone having no mutations was digested with EcoRl and the endogenous GPR30 cDNA fragment 
was transfeired into the CMV-driven expression plasmid pCI-neo (Promega), by digesting pCI- 
Neo with EcoRI and subcloning the EcoRI-liberated GPR30 fragment from pCR2.1/GPR30, to 
generate pCI/GPR30. Thereafter, the leucine at codon 258 was mutated to a lysine using a Quick- 
Change^''^ Site-Directed Mutagenesis Kit (Stratagene, #200518), according to manufacturer's 

1 5 instructions, and the following primers: 

5'-CGGCGGCAGAAGGCGAAACGCATGATCCTCGCGGT-3* (SEQ.ID.NO.: 239) and 
5'-ACCGCGAGGATCATGCGTTTCGCCTTCTGC CGCCG-3' (SEQ.ID.NO.: 240) 
Example 3 

Receptor (Endogenous and Mutated) Expression 

20 

Although a variety of cells are available to the art for the expression of proteins, it is most 
preferred that mammalian cells be utilized. The primary reason for this is predicated upon 
practicahties, i.e., utilization of, e.g., yeast cells for the expression of a GPCR, while possible, 
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10 



introduces into the protocol a non-mammalian cell which may not (indeed, in the case of 
yeast, does not) include the receptor-coupling, genetic-mechanism and secretary pathways that 
have evolved for mammalian systems - thus, results obtained in non-mammalian cells, while 
of potential use, are not as preferred as that obtained from mammalian cells. Of the 
5 mammalian cells, COS-7, 293 and 293T cells are particularly preferred, although the specific 
mammalian cell utilized can be predicated upon the particular needs of the artisan. 

Unless otherwise noted herein, the following protocol was utilized for the expression 

oftheendogenous andnon-endogenous human GPCRs.TableDliststhemammalian cell and 
number utilized (per 150mm plate) for GPCR expression. 

Table D 



15 



20 



Receptor Name 


Mammalian CeU 


(Endogenous or Non- 


(Number Utilized) 


Endogenous) 




GPR17 


293(2x 10*) 


GPR30 


293 (4 X 10*) 


APJ 


COS-7 (5X10») 


ETBR-LP2 


293 (1 X lO') 




293T(lxlO') 


GHSR 


293(1 x 10^) 




293T(lx 100 


MIG 


293 (1 x 100 


SoDtoninSHTzA 


2931(1 x 100 


Serotonin 5HT2c 


293T(lxl00 



On day one, mammalian cells were plated out. On day two, two reaction tubes were 
prepared (the proportions to follow for each tube are per plate): tube A was prepai«d by mixing 
20ng DNA {e.g., pCMV vector, pCMV vector with endogenous receptor cDNA, and pCMV 
25 vector with non-endogenous receptor cDNA.) in 1 .2ml seium free DMEM (Irvine Scientific, 
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Irvine, CA); tube B was prepared by mixing 120^1 lipofectamine (Gibco BRL) in 1.2ml serum 
free DMEM. Tubes A and B were then admixed by inversions (several times), followed by 
incubation at room temperature for 30-45min, The admixture is referred to as the "transfection 
mixture". Plated cells were washed with IXPBS, followed by addition of 10ml serum free 
5 DMEM. 2.4ml of the transfection mixture was then added to the cells, followed by incubation 
for 4hrs at 37°C/5% COj. The transfection mixture was then removed by aspiration, followed by 
the addition of 25ml of DMEM/10% Fetal Bovine Serum. Cells were then incubated at 37*a5% 
COj. After 72hr incubation, cells were then harvested and utilized for analysis. 
1. Gi-Coupled Receptors: Co-Transfection with Gs-Coupled Receptors 
10 In the case of GPR30, it has been deteimined that this receptor couples the G protein Gi. 

Gi is known to inhibit the enzyme adenylyl cyclase, which is necessary for catalyzing the 
conversion of ATP to cAMP. Thus, a non-endogenous, constitutively activated form of GPR30 
would be expected to be associated with decreased levels of cAMP. Assay confimiation of a non- 
endogenous, constitutively activated form of GPR3 0 directly via measurement of decreasing levels 
1 5 of cAMP, while viable, can be preferably measured by cooperative use of a Gs-coupled receptor. 
For example, a receptor that is Gs-coupled will stimulate adenylyl cyclase, and thus will be 
associated with an increase in cAMP. The assignee of the present application has discovered that 
the orphan receptor GPR6 is an endogenous, constitutively activated GPCR, GPR6 couples to the 
Gs protein. Thus when co-transfected, one can readily verify that a putative GPR30-mutation 
20 leads to constitutive activation thereof: i.e,, an endogenous, constitutively activated 
GPR6/endogenous, non-constitutively activated GPR30 cell will evidence an elevated level of 
cAMP when compared with an endogenous, constitutively active GPR6/non-endogenous, 
constitutively activated GPR30 (the latter evidencing a comparatively lower level of cAMP), 
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Assays that detect cAMP can be utilized to determine if a candidate compound is eg., an inverse 
agonist to a Gs-associated receptor {/.e., such a compound would decrease the levels of cAMP) or 
a Gi-associated receptor (or a Go-associated receptor) (/.e, such a candidate compound would 
increase the levels of cAMP). A variety of approaches known in the art for measuring cAMP can 
5 be utilized; a preferred ^proach relies upon the use of anti-cAMP antibodies. Another approach, 
and most preferred, utilizes a whole cell second messenger reporter system assay. Promoters on 
genes drive the expression of the proteins that a particular gene encodes. Cyclic AMP drives gene 
expression by promoting the binding of a cAMP-responsi ve DN A binding protein or transcription 
factor (CREB) which then binds to the promoter at specific sites called cAMP response elements 
1 0 and drives the expression of the gene. Reporter systems can be constructed which have a promoter 
containing multiple cAMP response elements before the reporter gene, eg.. P-galactosidase or 
luciferase. Thus, an activated receptor such as GPR6 causes the accumulation of cAMP which then 
activates the gene and expression of the reporter protein. Most preferably, 293 cells are co- 
Iransfected with GPR6 (or another Gs-linked receptor) and GPR30 (or another Gi-linked receptor) 
15 plasmids, preferably in a 1:1 ratio, most preferably in a 1:4 ratio. Because GPR6 is an 
endogenous, constitutively activereceptorthat stimulates theproductionof cAMP, GPR6 strongly 
activates the reporter gene and its expression. The reporter protein such as P-galactosidase or 
luciferase can then be detected using standard biochemical assays (Qien et al. 1995). Co- 
transfection of endogenous, constitutively active GPR6 with endogenous, non-constitutively active 
20 GPR30 evidences an increase in the luciferase reporter protein. Conversely, co-transfection of 
endogenous, constitutively active GPR6 with non-endogenous, constitutively active GPR30 
evidences a drastic decrease in expression of luciferase. Several reporter plasmids are known and 
available in the art for measuring a second messenger assay. It is considered well within the 
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skilled artisan to determine an ^propriate reporter plasmid for a particular gene expression based 
primarily upon the particular need of the artisan. Although a variety of cells are available for 
expression, mammalian cells are most prefened, and of these types, 293 cells are most preferred. 
293 cells were transfected with the reporter plasmid pCRE-Luc/GPR6 and non-endogenous, 
5 constitutively activated GPR30 using a Mammalian Transfection^w Kit (Stratagene, #200285) 
CaP04 precipitation protocol according to the manufacturer's instructions {see, 28 Genomics 347 
( 1 995) for the published endogenous GPR6 sequence). The precipitate contained 400ng reporter, 
80ng CMV-expression plasmid (having a 1 :4 GPR6 to endogenous GPR30 or non-endogenous 
GPR30 ratio) and 20ng CMV-SEAP (a transfection control plasmid encoding secreted alkaline 
10 phosphatase). 50% of the precipitate was split into 3 wells of a 96-well tissue culture dish 
(containing 4X10^ cells/well); the remaining 50% was discarded The following morning, the 
media was changed. 48 hr after the start of the transfection, cells were lysed and examined for 
luciferase activity using a Luclite™ Kit (Packard, Cat. # 601691 1) and Trilux 1450 Microbeta™ 
hquid scintillation and luminescence counter (Wallac) as per the vendor's instructions. The data 
1 5 were analyzed using GraphPad Prism 2.0a (Gr^hPad Software Inc.). 

With respect to GPR17, which has also been determined to be Gi-Iinked, a modification 
of the foregoing approach was utilized, based upon, inter alia, use of another Gs-linked 
endogenous receptor, GPRS (see 23 Genomics 609 (1994) and 24 Genomics 391 (1994)). Most 
preferably, 293 cells are utilized. These cells were plated-out on 96 well plates at a density of 2 
20 x\(f cells per well and were transfected using Lipofectamine Reagent (BRL) the following day 
according to manufecturer instmctions. A DNA/lipid mixture was prepared for each 6-well 
transfection as follows: 260ng of plasmid DNA in 100^1 of DMEM were gently mixed with 2^1 
of lipid in 100^1 of DMEM (the 26Qng of plasmid DNA consisted of 200ng of a 8xCRE-Luc 
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reporterplasmid (^ee below), 50ng ofpCMV comprising endogenous receptor or non-endogenous 
receptor or pCMV alone, and lOng of a GPRS expression plasmid (GPRS in pcDNA3 
(Invitrogen)). The 8XCRE-Luc reporter plasmid was prepared as follows: vector SRDF-P-gal was 
obtained by cloning the rat somatostatin promoter (-71/+51) at BglV-HindHI site in the pPgal- 
5 Basic Vector (Clontech). Eight (8) copies of cAMP response element were obtained by PGR fix)m 
an adenovirus template AdpCF 1 26CCRE8 {see 7 Human Gene Therapy 1883(1 996)) and cloned 
into the SRIF-p-gal vector at the Kpn-BglV site, resulting in the 8xCRE-P-gal reporter vector. The 
8xCRE-Luc reporter plasmid was generated by replacing the beta-galactosidase gene in the 
8xCRE-P-gal reporter vector with the luciferase gene obtained from the pGL3-basic vector 

10 (Promega) at the Hindlll-BamHI site. Following 30min. incubation at room temperature, the 
DNA/lipid mixture was diluted with 400 |il of DMEM and 1 00^1 of the diluted mixture was added 
to each well. 1 00 |il of DMEM with 1 0% FCS were added to each well after a 4hr incubation in 
a cell culture incubator. The next morning the transfected cells were changed with 200 ^1/well of 
DMEM with 10% FCS. Eight (8) hours later, the wells were changed to 100 ^l /well of DMEM 

15 without phenol red, after one wash with PBS. Luciferase activity were measured the next day 
using the LucLite™ reporter gene assay kit (Packard) following manufacturer instructions and read 
on a 1450 MicroBeta*^*^ scintillation and luminescence counter (Wallac). 

Figure 4 evidences that constitutively active GPR30 inhibits GPR6-mediated 
activation of CRE-Luc reporter in 293 cells. Luciferase was measured at about 4.1 relative 

20 light units in the expression vector pCMV. Endogenous GPR30 expressed luciferase at about 
8.5 relative light units, whereas the non-endogenous, constitutively active GPR30 (L258K), 
expressed luciferase at about 3.8 and 3. 1 relative light units, respectively. Co-transfection of 
endogenous GPR6 with endogenous GPR30, at a 1 :4 ratio, drastically increased luciferase 
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expression to about 104.1 relative light units. Co-transfection of endogenous GPR6 with non- 
endogenous GPR30 (L258K), at the same ratio, drastically decreased the expression, which 
is evident at about 18.2 and 29.5 relative light units, respectively. Similar results were 
observed with respect to GPR17 with respect to co-transfection with GPRS, as set forth in 
5 Figure 5. 
Example 3 

Assays For determination of Constitutive Activity 
OF Non-Endogenous GPCRs 

A. Membrane Binding Assays 

10 1. p^SJGTPyS Assay 

When a G protein-coupled receptor is in its active state, either as a result of ligand binding 
or constitutive activation, the receptor couples to a G protein and stimulates the release of GDP 
and subsequent binding ofGTP to the G protein. The alpha subunit of the G protein-receptor 
complex acts as a GTPase and slowly hydrolyzes the GTP to GDP, at which point the receptor 

1 5 normally is deactivated. Constitutively activated receptors continue to exchange GDP for GTP. 
The non-hydrolyzable GTP analog, p^S]GTPyS, can be utilized to demonstrate enhanced binding 
of p^S]GTPyS to membranes expressing constitutively activated receptors. The advantage of 
using p^S]GTPyS binding to measure constitutive activation is that: (a) it is generically applicable 
to all G protein-coupled receptors; (b) it is proximal at the membrane surface making it less likely 

20 to pick-up molecules which affect the intracellular cascade. 

The assay utilizes the ability of G protein coupled receptors to stimulate ["S]GTPyS 
binding to membranes expressing the relevant receptors. The assay can, therefore, be used m 
the direct identification method to screen candidate compounds to known, orphan and 
constitutively activated G protein-coupled receptors. The assay is generic and has application 
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to drug discovery at all G protein-coupled receptors. 

The [^^SJGTPyS assay was incubated in 20 mM HEPES and between 1 and about 20mM MgCl2 
(this amount can be adjusted for optimization of results, although 20mM is preferred) pH 7.4, 
binding buffer with between about 03 and about 1 .2 nM ["S]GTPyS (this amount can be adjusted 
5 for optimization of results, although 1.2 is preferred ) and 12.5 to 75 |ig membrane protein (eg, 
COS-7 cells expressing the receptor; this amount can be adjusted for optimization, although 75ng 
is preferred) and 1 [iM GDP (this amount can be changed for optimization) for 1 hour. 
Wheatgerm agglutinin beads (25 jal; Amersham) were then added and the mixture was incubated 
for another 30 minutes at room temperature. The tubes were then centrifiiged at 1500 x g for 5 
1 0 minutes at room temperature and then counted in a scintillation counter. 

A less costly but equally applicable alternative has been identified which also meets the 
needs of large scale screening. Flash plates^" and Wallac™ scintistrips may be utilized to format 
a high throughput p'S]GTI*yS binding assay. Furthermore, using this technique, the assay can be 
utilized for known GPCRs to simultaneously monitor tritiated Ugand binding to the receptor at the 
1 5 same time as monitoring the efficacy via pS]GTPyS binding. This is possible because the Wallac 
beta counter can switch energy windows to look at both tritium and ^^S-labeled probes. This assay 
may also be used to detect other types of membrane activation events resulting in receptor 
activation. For example, the assay may be used to monitor ^^P phosphorylation of a variety of 
receptors (both G protein coupled and tyrosine kinase receptors). When the membranes are 
20 centrifiiged to die bottom of the well, the bound [^^SJGTPyS or the "P-phosphoiylated receptor 
will activate the scintillant which is coated of the wells, Scinti® strips (Wallac) have been used to 
demonstrate this principle. In addition, the assay also has utility for measuring ligand binding to 
receptors using radioactively labeled ligands. In a similar manner, when the radiolabeled bound 
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ligand is centrifiiged to the bottom of the well, the scintistrip label comes into proximity with the 
radiolabeled ligand resulting in activation and detection. 

Representative results of graph comparing Control (pCMV), Endogenous APJ and Non- 
Endogenous APJ, based upon the foregoing protocol, are set forth in Figure 6. 
5 2. Adenylyl Cyclase 

A Flash PlateTM Adenylyl Cyclase kit (New England Nuclear, Cat. No. SMP604A) 
designed for cell-based assays was modified for use with crude plasma membranes. The Flash 
Plate wells contain a scintillant coating which also contains a specific antibody recognizing cAMP. 
The cAMP generated in the wells was quantitated by a direct competition for binding of 
10 radioactive cAMP tracer to the cAMP antibody. The following serves as a brief protocol for the 
measurement of changes in cAMP levels in membranes that express the receptors. 

Transfected cells were harvested ^^proximately three days after transfection. Membranes 
were prepared by homogenization of suspended cells in buffer containing 20mM HEPES, pH 7.4 
and lOmM MgCl2. Homogenization was performed on ice using a Brinkman Polytron™ for 
15 approximately 10 seconds. The resulting homogenate was centrifiiged at 49,000 X g for 15 
minutes at 4°C. The resulting pellet was then resuspended in buffer containing 20mM HEPES, 
pH 7.4 and 0.1 mM EDTA, homogenized for 10 seconds, followed by centrifiigation at 49,000 X 
g for 15 minutes at 4°C. The resulting pellet can be stored at -80**C until utilized. On the day of 
measurement, the membrane pellet was slowly thawed at room temperature, resuspended in buffer 
20 containing 20mM HEPES, pH 7.4 and 1 OmM MgCLj (these amounts can be optimized, although 
the values listed herein are prefereed), to yield a final protein concentration of 0.60mg/ml (the 
resuspended membranes were placed on ice until use). 

cAMP standards and Detection Buffer (comprising 2 ^Ci of tracer ['"I cAMP (1 00 ^1] to 
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1 1 ml Detection Buffer) were prepared and maintained in accordance with the manufacturer's 
instructions. Assay Buffer was prepared fresh for screening and contained 20mM HEPES, pH 7.4, 
IQmM MgClj, 20mM (Sigma), 0.1 units/ml creatine phosphokinase (Sigma), 50 fiM GTP 
(Sigma), and 0.2 mM ATP (Sigma); Assay Buffer can be stored on ice until utilized. The assay 
was initiated by addition of 50ul of assay buffer followed by addition of 50ul of membrane 
suspension to the MEN Flash Plate. The resultant assay mixture is incubated for 60 minutes at 
room temperature followed by addition of I OOul of detection buffer. Plates are then incubated an 
additional 2-4 hours followed by counting in a Wallac MicroBeta scintillation counter. Values of 
cAMP/well are extrapolated from a standard cAMP curve which is contained witiiin each assay 
plate. The foregoing assay was utilized with respect to analysis of MIG. 
B. Reporter-Based Assays 

1 . CREB Reporter Assay (Gs-associated receptors) 
A method to detect Gs stimulation depends on the known property of the transcription 
factor CREB, which is activated in a cAMP-dependent manner. A PathDetect CREB trans- 
Reporting System (Stratagene, Catalogue #219010) was utilized to assay for Gs coupled 
activity in 293 or 293T cells. Cells were transfected with the plasmids components of this 
above system and the indicated expression plashiid encoding endogenous or mutant receptor 
using a Mammalian Transfection Kit (Stratagene, Catalogue #200285) according to the 
manufacurer's instructions. Briefly, 400 ng pFR-Luc (luciferase reporter plasmid containing 
Gal4 recognition sequences), 40 ng pFA2-CREB (Gal4-CREB fusion protein containing the 
Gal4 DNA-binding domain), 80 ng CMV-receptor expression plasmid (comprising the 
receptor) and 20 ng CM V-SEAP (secreted alkaline phosphatase expression plasmid; alkaline 
phosphatase activity is measured in the media of transfected cells to control for variations in 
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transfection efficiency between samples) were combined in a calcium phosphate precipitate 
as per the Kit's instructions. Half of the precipitate was equally distributed over 3 wells in a 
96-well plate, kept on the cells overnight, and replaced with fresh medium the following 
morning. Forty-eight (48) hr after the start of the transfection, cells were treated and assayed 
5 for luciferase activity as set forth with resepct to the GPR30 system, above. This assay was 
used with respect to GHSR. 

2, API reporter assay (Gq-associated receptors) 
Ae method to detect Gq stimulation depends on the known property of Gq-dependent 
phospholipase C to cause the activation of genes containing API elements in their promoter. 
10 A Pathdetect AP-1 cis-Reporting System (Stratagene, Catalogue # 219073) was utilized 
following the protocl set forth above with respect to the CREB reporter assay, except that the 
components of the calcium phosphate precipitate were 410 ng pAPl-Luc, 80 ng receptor 
expression plasmid, and 20 ng CMV-SEAP, This assay was used with respect to ETBR-LP2 

C Intracellular IP3 Accumulation Assay 
15 On day 1, cells comprising the serotonin receptors (endogenous and mutated) were 

plated onto 24 well plates, usually 1x10^ cells/well. On day 2 cells were transfected by firstly 

mixing 0.25ug DNA in 50 ul serumfree DMEM/well and 2 ul lipofectamine in 50 ^1 

serumfree DMEM/well. The solutions were gently mixed and incubated for 15-30 min at 

room temperature. Cells were washed with 0.5 ml PBS and 400 pi of serum free media was 

20 mixed with the transfection media and added to the cells. The cells were then incubated for 

3-4 hrs at 37°C/5%C02 and then the transfection media was removed and replaced with 

1 ml/well of regular growth media. On day 3 the cells were labeled with ^H-myo-inositol. 

Briefly, the media was removed the cells were washed with 0.5 ml PBS. Then 0.5 ml inositol- 

free/serumfree media ( GBCO BRL) was added/well with 0.25 pCi of ^H-myo-inositol / well 
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and the cells were incubated for 16-18 hrs o/n at 37T/5%C02 • On Day 4 the cells were 
washed with 0.5 ml PBS and 0.45 ml of assay medium was added containing inositol- 
free/serum free media 10 pargyline 10 mM lithium chloride or 0.4 ml of assay medium 
and 50 ul of lOx ketanserin (ket) to final concentration of lO^M. The cells were then 
5 mcubated for 30 min at 37**C. The cells were then washed with 0.5 ml PBSand 200 ul of 
fresh/icecold stop solution (IM KOH; 18 mM Na-borate; 3.8 raM EDTA) was added/well. 
The solution was kept on ice for 5-10 min or until cells were lysed and then neutralized by 
200 lil of fresh/ice cold neutralization sol. (7.5 % HCL). The lysate was then transferred into 
1 .5 ml eppendorf tubes and 1 ml of chloroform/methanol ( 1 :2) was added/tube. The solution 
10 was vortexed for 15 sec and the upper phase was applied to a Biorad AG1-X8 anion 
exchange resin (1 00-200 mesh). Firstly, the resin was washed with water at 1 : 1 .25 WA^ and 
0.9 ml of upper phase was loaded onto the column. The column was washed with 10 mis of 
5 mM myo-inositol and 10 ml of 5 mM Na-borate/60mM Na-formate. The inositol tris 
phosphates were eluted into scintillation vials containing 10 ml of scintillation cocktail with 
15 2 ml of 0.1 M formic acid/ 1 M ammonium formate. The columns were regenerated by 
washing with 1 0 ml of 0.1 M formic acid/3M ammonium formate and rinsed twice with dd 
HjO and stored at 4*^0 in water. 

Figure 7 provides an illustration of IP3 production from the human 5-HT2A receptor 
that incorporates the C322K mutation. While these results evidence that the Proline Mutation 
20 Algorithm approach constitutively activates this receptor, for purposes of using such a 
receptor for screening for identification of potential therapeutics, a more robust difference 
would bepreferred. However, because the activated receptor can be utilized for understanding 
and elucidating the role of constitutive activation and for the identification of compounds that 
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can be further examined, we believe that this difference is itself useful in differentiating 
between the endogenous and non-endogenous versions of the human SHTja receptor. 
D. Result Summary 

The results for the GPCRs tested are set forth in Table E where the Per-Cent Increase 
5 indicates the percentage difference in results observed for the non-endogenous GPCR as compared 
to the endogenous GPCR; these values are followed by parenthetical indications as to the type of 
assay utilized. Additionally, the assay sytem utilized is parenthetically listed (and, in cases where 
different Host Cells were used, both are listed). As these results indicate, a variety of assays can 
beutilized to detemiineconstitutiveactivityofthe non-endogenous versionsofthe human GPCRs. 
1 0 Those skilled in the art, based upon the foregoing and with reference to infomiation available to 
the art, are creditied with theability to selelect and/ot maximize a particular assay approach that 
suites the particualr needs of theinvestigator. 



Table E 



Receptor Identifier 


Per-Cent Difference 


(Codon Mutation) 




GPR17 


74.5 


(V234K) 


(CRE-Luc) 


GPR30 


71.6 


{U58K) 


(CREB) 


APJ 


49.0 


(L247K) 


(GTPyS) 


ETBR-LP2 


48.4(AP1-Luc - 293) 


{N358K) 


61.1(AP1-Luc-293T) 



GHSR 


58.9(CREB-293) 


(V262K) 


35.6(CREB-293T) 
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MIG 


39 (cAMP) 


(I230K) 




Serotonin SHTja 


33.2 (IP3) 


(C322K) 




Serotonin SHTjc 


39.1(IP3) 


{S310K) 





Example 6 

Tissue Distribution of Endogenous Orphan GPCRs 

Using a conunercially available human-tissue dot-blot format, endogenous oiphan GPCRs 
were probed for a determination of the areas where such receptors are localized. Except as indicate 
below, the entire receptor cDNA (radiolabelled) was used as the probe: radiolabeled probe was 
generated using the complete receptor cDNA (excised from the vector) using a Prime-It II™ 
Random Primer Labeling Kit (Stratagene, #300385), according to manufacturer's instructions. 
A human RNA Master Blot™ (Clontech, #7770-1) was hybridized with the GPCR 
radiolabeled probe and washed under stringent conditions according manufacturer's 
instructions. The blot was exposed to Kodak BioMax Autoradiography film overnight at - 
80°C. 

Representative dot-blot fomiat results are presented in Figure 8 for GPRl (8A), GPR30 
(8B), and APJ (8C), with results being summarized for aU receptors in Table F 



Table F 



GPCR 


Tissue Distribution 
(highest levels, relative to other tissues in 
die dot-blot) 


GPRl 


Placenta, Ovary, Adrenal 
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GPR4 


Broad; highest in Heart, Lung, Adrenal, 




Thyroid, Spinal Cord 


GPR5 


Placenta, Thymus, Fetal Thymus 




Lesser levels in spleen, fetal spleen 


GPR7 


Liver, Spleen, Spinal Cord, Placenta 


GPRS 


No expression detected 


GPR9-6 


Thymus, Fetal Thymus 




Lesser levels m Small Intestme 


GPR18 


Spleen, Lymph Node, Fetal Spleen, Testis 


GPR20 


Broad 


GPR21 


Broad; very low abundance 


GPR22 


Heart, Fetal Heart 




Lesser levels in Brain 


GPR30 


Stomach 


GPR31 


Broad 


BLRl 


Spleen 


CEPR 


Stomach, Liver, Thyroid, Putamen 


EBIl 


Pancreas 




Lesser levels in Lymphoid Tissues 


EBI2 


Lymphoid Tissues, Aorta, Lung, Spmal Cord 


ETBR-LP2 


Broad; Brain Tissue 


GPCR-CNS 


Brain 




j^esser leveis in i esus, riaccnia 


GPR-NGA 


Pituitary 




Lesser levels in Brain 


H9 


Pituitary 


HB954 


Aorta, Cerebellum 




Lesser levels in most other tissues 


HM74 


Spleen, Leukocytes, Bone marrow, Mammary 




Glands, Lung, Trachea 


MIG 


Low levels in Kidney, Liver, Pancreas, Lung, 




Spleen 


ORGl 


Pituitary, Stomach, Placenta 


V28 


Brain, Spleen, Peripheral Leukocytes 



25 Based upon the foregoing information, it is noted that human GPCRs can also be assessed 

for distribution in diseased tissue; comparative assessments betweai "normal" and diseased tissue 
can then be utilized to determine the potential for over-expression or under-expression of a 
particular receptor in a diseased state, hi those circumstances where it is desirable to utilize the 
non-endogenous versions of the human GPCRs for the purpose of screening to direcdy idditify 
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candidate compounds of potential therapeutic relevance, it is noted that in veree agonists are useful 
in the treatment of diseases and disorders where a particular human GPCR is over-expressed, 
whereas agonists or partial agonists are useful in the treatment of diseases and disorders where a 
particular human GPCR is under-expressed. 
5 As desired, more detailed, cellular localization of the recepotrs. using techniques well- 

known to those in the art (e,g., in-situ hybridization) can be utiUzed to identify particuafr cells 
within these tissues where the receptor of interest is expressed. 

It is intended that each of the patents, applications, and printed publications mentioned in 
this patent document be hereby incorporated by reference in their entirety. 
1 0 As those skilled in the art will appreciate, numerous changes and modifications may be 

made to the preferred embodiments of the invention without departing ftom the spirit of the 
invention. It is intended that all such variations fall within the scope of the invention. 

Although a variety of expression vectors are available to those in the art, for purposes of 
utilization for both the endogenous and non-endogenous human GPCRs, it is most preferred that 
1 5 the vector utilized be pCMV. This vector has been deposited with the American Type Culture 
Collection (ATCC) on October 13, 1998 (10801 University Blvd., Manassas, VA 201 10-2209 
USA) under the provsions of the Bud^est Treaty for the International Recognition of the Deposit 
of Microorganisms for the Purpose of patent Procedure. The vector was tested by the ATCC on 

, 1998 and determined to be viable on , 1998. The ATCC has assigned 

20 the following deposit number to pCMV: . 
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CLAIMS 

What is claimed is: 

1 . A constitutively active, non-endogenous version of an endogenous human orphan G protein- 
coupled receptor (GPCR) comprising the following amino acid residues (carboxy-terminus to amino- 
terminus orientation) transversing the transmembrane-6 (TM6) and intracellular loop-3 (IC3) regions 
of the non-endogenous GPCR: 

P* AA,5 X 

wherein: 

(1) P' is an amino acid residue located within the TM6 region of the non- 

endogenous GPCR, where P* is selected from the group consisting 
of (i) the endogenous orphan GPCR proline residue, and (ii) a non- 
endogenous amino acid residue other than proline; 

(2) AA„ are 15 amino acid residues selected from the group consisting 
of (a) the 15 endogenous amino acid residues of the endogenous 
orphan GPCR, (b) 15 non-endogenous amino acid residues, and (c) 
a combination of 15 amino acid residues, the combination 
comprising at least one endogenous amino acid residue of the 
endogenous orphan GPCR and at least one non-endogenous amino 
acid residue, excepting that none of the 15 endogenous amino acid 
residues that are positioned within the TM6 region of the GPCR is 
proline; and 

(2) X is a non-endogenous amino acid residue located within the ICS region 
of said non-endogenous GPCR. 

The non-endogenous human GPCR of claim 1 wherein P* is the endogenous proline 
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residue. 

3. The non-endogenous human GPCR of claiml wherein is a non-endogenous amino 
acid residue other than a proline residue. 

4. The non-endogenous human GPCR of claim 1 wherein AA,, are the 15 endogenous 
amino acid residues of the endogenous GPCR. 

5. The non-endogenous human GPCR of claim 1 wherein X is selected from the group 
consisting of lysine, hisitidine, arganine and alanine residues, excepting that when the 
endogenous amino acid in position X of said endogenous human GPCR is lysine, X 
is selected from the group consisting of histidine, arginine and alanine. 

6. The non-endogenous human GPCR of claim 1 wherein X is a lysine residue, excepting 
that when the endogenous ^ino acid in position X of said endogenous human GPCR 
is lysine, X is an amino acid other than lysine. 

7. The non-endogenous human GPCR of claim 4 wherein X is a lysine residue, excepting 
that when the endogenous amino acid in position X of said endogenous human GPCR 
is lysine, X is an amino acid other than lysine. 

8. The non-endogenous, human GPCR of claim 1 wherein P' is a proline residue and X 
is a lysine residue, excepting that when the endogenous amino acid in position X of 
said endogenous human GPCR is lysine, X is an amino acid other than lysine. 

9. A host ceil comprising the non-endogenous human GPCR of claim 1 . 

1 0. The material of claim 9 wherein said host cell is of mammalian origin. 

11. The non-endogenous human GPCR of claim 1 in a purified and isolated form. 

12. A nucleic acid sequence encoding a constitutively active, non-endogenous version of 
an endogenous human oiphan G protein-coupled receptor (GPCR) comprising the following 
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nucleic acid sequence region transversing the transmembrane-6 (TM6) and intracellular loop-3 
(IC3) regions of the orphan GPCR: 

3'-P-^-(AA-codon),3Xcodo„-5' 

wherein: 

5 pcodon jg a nucleic acid encoding region within the TM6 region of the 

non-endogenous GPCR, where P'^"***"" encodes an amino acid selected 
from the group consisting of (i) the endogenous GPCR proline residue, 
and (ii) a non-endogenous amino acid residue other than proline; 
(2) (AA-codon),5are 15 codons encoding 1 5 amino acid residues selected 
from the group consisting of (a) the 15 endogenous amino acid 
residues of the endogenous orphan GPCR, (b) 15 non-endogenous 
amino acid residues, and (c) a combination of 1 5 amino acid residues, 
the combination comprising at least one endogenous amino acid 
residue of the endogenous orphan GPCR and at least one non- 
^5 endogenous amino acid residue, excepting that none of the 15 

endogenous amino acid residues that are positioned within the TM6 , 
region of the orphan GPCR is proline; and 
(3) Xcodon is a nucleic acid encoding region residue located within the ICS 
region of said non-endogenous human GPCR, where X^odon encodes a 
20 non-endogenous amino acid. 

13. The nucleic acid sequence of claim 1 2 wherein P"**" encodes an endogenous proline 
residue. 

14. The nucleic acid sequence of claim 12 wherein P^" encodes a non-endogenous 
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amino acid residue other than a proline residue. 

15. The nucleic acid sequence of claim 12 wherein X^^„ encodes a non-endogenous 
amino acid selected from the group consisting of lysine, histidine, arginine and 
alanine, excepting that when the endogenous amino acid in position X of said 

5 endogenous human GPCR is lysine, X^odon encodes an amino acid selected from the 

group consisiting of histidine, arginine and alanine. 

16. The nucleic acid sequence of claim 13 wherein X«,don encodes a non-endogenous 
lysine amino acid excepting that when the endogenous amino acid in position X of 
said endogenous human GPCR is lysine, X^odon encodes an amino acid selected from 

1 0 the group consisiting of histidine, arginine and alanine. 

17. The nucleic acid sequence of claim 12 wherein X^odon is selected from the group 
consisting of AAA, AAG, GCA, GCG, GCC and GCU. 

18. The nucleic acid sequence of claim 12 wherein X^^don is selected from the group 
consisting of AAA and AAG. 

15 19. The nucleic acid sequence of claim 12 wherein P*^°" is selected from the group 
consisting of CCA, CCC, CCG and CCU, and X^,„ is selected from the group 
consisting of AAA and AAG. 

20. A vector comprising the nucleic acid sequence of claim 12. 

21. A plasmid comprising the nucleic acid sequence of claim 12. 
20 22. A host cell comprising the nucleic acid sequence of claim 2 1 . 

23. The nucleic acid sequence of claim 12 in a purified and isolated form. 

24. A method for selecting for alteration an endogenous amino acid residue within the 
third intracellular loop of a human G protein-coupled receptor ("GPCR"), said receptor 
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comprising a transmembrane 6 region and an intracellular loop 3 region, which endogenous 
amino acid, when altered to a non-endogenous amino acid, constitutively activates said human 
GPCR, comprising the following steps: 

(a) identifying an endogenous proline residue within the transmembrane 6 region 
5 of a human GPCR; 

(b) identifying, by moving in a direction of the carboxy-terminus region of said 
GPCR towards the amino-terminus region of said GPCR, the endogenous, 1 6* 
amino acid residue from said proline residue; 

(c) altering the endogenous residue of step (b) to a non-endogenous amino acid 
1 0 residue to create a non-endogenous version of an endogenous human GPCR; 

and 

(d) determining whether the non-endogenous human GPCR of step (c) is 
constitutively active. 

25 . The method of claim 24 wherein the amino acid residue that is two residues from said 
15 proline residue in the transmembrane 6 region, in a carboxy-terminus to amino- 
terminus direction, is tryptophan. 

26. A constitutively active, non-endogenous human GPCR produced by the process of 
claim 24. 

27. A constitutively active, non-endogenous human GPCR produced by the process of 
20 claim 25. 

28. An algorithmic approach for creating a non-endogenous, constitutively active version 
of an endogenous human G protein coupled receptor (GPCR), said endogenous GPCR 
comprising a transmembrane 6 region and an intracellular loop 3 region, the 
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algorithmic approach comprising the steps of: 

(a) selecting an endogenous human GPCR comprising a proline residue in the 
transmembrane-6 region; 

(b) identifying, by counting 16 amino acid residues from the proline residue of 
step (a), in a carboxy-terminus to amino-terminus direction, an endogenous 
amino acid residue; 

(c) altering the identified amino acid residue of step (b) to a non-endogenous 
amino acid residue to create a non-endogenous version of the endogenous 
human GPCR; and 

(d) determining if the non-endogenous version of the endogenous human GPCR 
of step (c) is constitutively active. 

29. The algorithmic approach of claim 28 v^herein the amino acid residue that is two 
residues from said proline residue in the transmembrane 6 region, in a carboxy- 
terminus to amino-terminus direction, is tryptophan. 

30. A constitutively active, non-endogenous human GPCR produced by the algorithmic 
approach of claim 28. 

31. A constitutively active, non-endogenous human GPCR produced by the algorithmic 
approach of claim 29. 

32. A method for directly identifying a compound selected from the group consisting of 
inverse agonists, agonists and partial agonists to a non-endogenous, constitutively 
activated human G protein coupled receptor, said receptor comprising a 
transmembrane-6 region and an intracellular loop-3 region, comprising the steps of: 
(a) selecting an endogenous human GPCR; 



wo 00/22129 



PCT/US99/23938 



-78- 

(b) identifying a proline residue within the transmembrane-6 region of the GPCR 
of step (a); 

(c) identifying, m a carboxy-terminus to amino-terminus direction, the 
endogenous, 16^ amino acid residue from the proline residue of step (b); 

5 (d) altering the endogenous amino acid of step (c) to a non-endogenous amino 

acid; 

(e) confirming that the non-endogenous GPCR of step (d) is constitutively active; 

(f) contacting a candidate compound with the non-endogenous, constitutively- 
activated GPCR of step (e); and 

(g) determining, by measurement of the compound efficacy at said contacted 
receptor, whether said compound is an mverse agonist, agonist or partial 
agonist of said receptor. 

33, The method of claim 32 wherein the non-endogenous amino acid of step (d) is lysine. 

34. A compound directly identified by the method of claim 32. 

15 35. The method of claim 32 wherein the directly identified compound is an inverse 
agonist. 

36. The method of claim 32 wherein the directly identified compound is an agonist.- 

37. The method of claim 32 wherein the directly identified compound is a partial agonist. 

38. A composition comprising the inverse agonist of claim 35. 
20 39. A composition comprising the agonist of claim 36. 

40. A composition comprising the partial agonist of claim 37. 

41. A method for directly identifying ah inverse agonist to a non-endogenous, 



wo 00/22129 



PCT/US99/23938 



-79- 

constitutively activated human G protein coupled receptor ("GPCR"), said GPCR comprising 
a transmembrane-6 region and an intracellular loop-3 region, comprising the steps of: 

(a) selecting an endogenous human GPCR; 

(b) identifying a proline residue within the transmembrane-6 region of the GPCR of 
step (a); 

(c) identifying, in a carboxy-terminus to amino-terminus direction, the 
endogenous, 16*^ amino acid residue from the proline residue of step (b); 

(d) altering the endogenous amino acid of step (c) to a non-endogenous lysine residue; 

(e) confirming that the non-endogenous GPCR of step (d) is constitutively active; 

(f) contacting a candidate compound with the non-endogenous, constitutively- 
activated GPCR of step (e); and 

(g) determining, by measurement of the compound efficacy at said contacted receptor, 
whether said compound is an inverse agonist of said receptor. 

42. An inverse agonist directly identified by the method of claim 37. 

43. A composition comprising an inverse agonist of claim 38. 
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